CJ-  *£  os-/ 3  :  ay^y-  /-  ^// 


Series  ISP  2,  No.  1-e 

Preliminary  Copy  —  May  1969 


AGROSTAN 

A  Case  Study  for  the  1970  World  Census  of  Agriculture 


UNIT  IV.    ENUMERATION  PLAN  AND  SAMPLE 

DESIGN 


Prepared  under  the  auspices  of  the  Agency  for  International  Development, 
U.S.  Department  of  State,  as  a  contribution  to  the  War  on  Hunger  Program. 


U.S.  DEPARTMENT  OF  COMMERCE    Maurice  H.  Stans,  Secretary 

Rocco  C.  Siciliano,  Under  Secretary 

William  H.  Chartener,  Assistant  Secretary  for  Economic  Affairs 

BUREAU  OF  THE  CENSUS     A.  Ross  Eckler,  Director 


AGROSTAN:    A  Case  Study  for  the 
1970  World  Census  of  Agriculture 


Series 
ISP  2,  No 

1- 

■a 

1- 

-b 

J- 

-c 

1 

-d 

1- 

-e 

1 

-f 

1-g 

1-h 

1-i 
l-j 

1-k 
l-l 

1-m 
1-n 
l-o 


Introduction  to  the  1970  Agriculture  Census 
Case  Study 

Unit  I.    Description  of  Country  and  Its 
Census  Objectives 

Unit  II.    Planning  and  Administrative 
Requirements 

Unit  III.    Content,  Scope,  and  Design  of  the 
Census 

Unit  IV.    Enumeration  Plan  and  Sample 
Design 

Unit  V.  Data-Collection  Procedures, 
Part  A  (Organization,  Training,  and 
Control) 

Unit  V.  Data-Collection  Procedures, 
Part  B  (Instruction  Manuals) 

Unit  VI.    Distribution,  Receipt,  and  Control 
of  Census  Materials 

Unit  VII.    Editing  and  Coding 

Unit  VIII.    Tabulation  Processes 

Unit  IX.    Review  and  Publication  of  Data 

Unit  X.    Current  Agricultural  Surveys  and 
Studies 

Principal  Data-Collection  and  Control  Forms 

Workbook  for  the  Case  Study 

Short  Course  in  Agricultural  Sampling  and 
Agricultural  Economics 


BUREAU  OF  THE  CENSUS 
A.  Ross  Eckler,  Director 

Robert  F.  Drury,  Deputy  Director 


CHARLES  B.  LAWRENCE,  Jr.,    Assistant  Director 
for  International  Statistical  Programs 


International  Statistical  Training  and  Workshop  Office 
Benjamin  Gura,  Chief 


ACKNOWLEDGMENTS-The  Agrostan  case  study  was  developed  by  the  Inter- 
national Statistical  Training  and  Workshop  Office,  U.S.Bureau  of  the  Census, 
under  the  direction  of  Beulah  Washabaugh,  Chief,  Statistical  Workshop  Branch. 
A  number  of  persons  contributed  to  the  concept  of  Agrostan  and  to  the  devel- 
opment of  the  materials  which  synthesize  job  knowledge  and  experience  in  a 
comprehensive  but  related  range  of  statistical  activities.  Major  contributions 
were  made  by  staff  members  of  the  U.S.  Bureau  of  the  Census,  the  U.S.  De- 
partment of  Agriculture,  and  the  Food  and  Agriculture  Organization  of  the 
United  Nations;  the  cooperation  of  these  persons  is  gratefully  acknowledged. 

Unit  IV  was  prepared  by  Floyd  E.  O'Quinn,  Mathematical  Statistician,  and 
Dr.  Catherine  S.  Manno,  Mathematical  Statistician,  International  Statistical 
Programs,  U.S.  Bureau  of  the  Census. 


CONTENTS 


Chapter  IV- 1.      DECISIONS   AFFECTING    DESIGN   OF    THE    CENSUS 


Page 

1 .  Introduction 1 

2.  Types   and  reliability  of  statistics   required 1 

2.  1       Types   of  statistics   required 1 

2.  2      Level  of  publication 1 

2.  3      Level  of  reliability 2 

2.  31     Provinces 2 

2.  32     Northern  Territory 2 


■I. 


Page 

Sample  or  complete  census 3 

3.  1      Advantages  of  a  complete  census 3 

3.  2      Advantages  of  a  sample  census 3 

3.  3      Decision  to  take  a  sample  census 3 

3.  4      Effects  of  nonsampling  errors 4 

Basic  principles  in  sample  design 4 


Chapter  IV-2.     SAMPLE    DESIGN   AND   ENUMERATION  PLAN   FOR    THE    PROVINCES 


Use  of  resources  available  from  previous 

censuses 

1.  1       Description  of  the   1961  Sample  Census  of 
Agriculture 

1.11  Level  of  publication  and  reliability. 

1.12  Sample  design 

1.13  Enume  ration  plan 

1.2      Use  of  the   1961   agricultural  data 

1.  3      Use  of  the   1970  Population  Census 

1.  4      Preliminary  1970  data 


Outline  of  the  1971   sample  design 

2.  1       Definition  of  the  universe 

2.  2       Reliability  required 

2.  3      Total  sample  size  and  workloads 

2.4  Sample  units,  sampling  rates,  and  methods 
of  selection 

2.  41     National  Certainty  holdings 

2.  42     The  first-stage  unit  (PSU) 

2.  43     Local  Certainty  holdings 

2.  44  The  second-stage  unit  (small  holding) 
2.  5      Weights 


Outline  of  the  enumeration  plan  .  .  .  . 
3.  1      Listing 

Preparing  the  sampling  frame 

Sample  selection 

Interviewing 

Supervision 


3.  2 
3.  3 
3.  4 
3.  5 


Reasons  for  various  features  of  the  sample 
design 


6. 


4.  1      Two-stage  sample 

4.  2       PSU's   of  nearly  equal  size 

4.  3      Stratification  of  PSU's 

4.  4      National  Certainty  holdings 

4.  5      Local  Certainty  holdings 

4.  6  Stratification  of  holdings  within  sample 

PSU's 

First  steps   in  calculating  the  number  of  sample 
holdings  needed 

5.  1       Size  of  sample   required  to  estimate   propor- 

tions with  a  simple  random  sample 

5.  2      Effect  of  clustering  on  sample   size 

5.  3       Size  of  sample   required  to  estimate 

averages  and  aggregates 


5.  6 
5.  7 


Effect  of  special  features  in  improving  the 
efficiency  of  the  sample 

Effect  of  a  constant  overall  sampling  rate 
on  district  sample   size 

Sample  size  for  cross-classifications  .... 

Summary 

and 


Determination  of  the  number  of  first- 
second-stage  sample  units  needed 

6.  1       Types  of  census   costs 

6.  1 1     Overhead  costs 

6.  12    PSU  cost 

6.  1  3     Sample  holding  costs 

6.  2       Determining  the  cost  function 

6.  3      Determining  the  number  of  PSU's  and  the 
average  size  of  subsample 


7.      Remarks  on  sample  design. 


I  0 
10 
10 

1 1) 

1  0 

10 


11 

11 
11 

13 

H 

14 
15 
15 


15 
16 
16 
16 
16 
16 

17 

17 


Chapter  IV- 3.     SELECTING   THE    SAMPLE   OF    FIRST-STAGE    UNITS 


1.      Introduction. 


2.      National  Certainty  holdings 

2.  1       Definition 

2.  2      Sources  of  data , 

2.  21     List  of  1961  large  holdings  .  .  . 

2.  22     The  Ministry  of  Agriculture  .  .  . 

2.  23    Local  officials 

2.  24    Associations  and  cooperatives  , 
2.  3       Small  holdings   erroneously  included  . 


2.4      Large  holdings  found  after   1971  lists  are 

compiled 19 

Sample  of  PSU's 20 

3.  1       Forming  the  PSU's 20 

3.  11     Rural  PSU's 21 

3.  12    Measures  of  size 21 

3.13     Urban  PSU's 21 

3.  14    PSU  card 22 

3.  2      Stratification  and  listing  of  the  PSU's 22 

3.  3      Selecting    sample  PSU's 23 


Chapter  IV-4.      LISTING   WITHIN   THE    SAMPLE 

1 .  Introduction 2  5 

2.  Listing 25 

2.  1       Relation  between  housing  units,    holders, 

and  holdings 25 

2.  2      Definitions  of  holder  and  holding 26 

2.  3      Out-of-scope  holdings 26 

2.  4      Listing  problems 26 


PSU   AND   SELECTION   OF    THE    SAMPLE   HOLDINGS 


Preparing  the   sampling  frame 27 

Selecting  the   sample  holdings 27 

4.  1       Local  Certainty  holdings 27 

4.  2      Small  holdings 28 

4.  3      PSU's  with  a  very  large  number  of  small 

holdings 28 


CONTENTS 


Chapter  IV-5.      SAMPLING   IN   THE    NORTHERN   TERRITORY 


Page 

Introduction 29 

Census  objectives  and  problems 29 

2.  1      Census  objectives 29 

2.11     Estimates  for  zones 29 

2.  12    Estimates  for  subgroups ■ 29 

2.13    Accuracy  of  the  estimates 29 

2.  2      Census  problems 30 

2.  21     Mapping 30 

2.  ZZ    Response  error 30 

2.  23    Polygamy 30 

2.  24    Between-PSU  travel 31 

2.  25    Cultivation  practices 31 

Sample  design 31 

3.  1       Considerations   affecting  the   sample   design.  ...  31 

3.  11     Definition  of  the  universe  and  estimated 

total  number  of  holdings 31 

3.  12     Definition  of  PSU's  and  estimated  num- 
bers by  population  size 32 

3.  13    Estimated  total  sample  size  needed 32 


Page 

3.  2      Basic  two-stage  sample  plan 32 

3.  3      Three-stage  sample  for  area  measurements.  .  .  33 

3.  4      National  Certainty  holdings 33 

4.  Selecting  the  sample  of  small  holdings 33 

4.  1      Listing  and  stratifying  PSU's 33 

4.  2      Selecting  the  PSU's 34 

4.  3      Listing  and  stratifying  holdings 35 

4.  4      Selecting  the  holdings  within  sample  PSU's  ....  35 

4.  5      Selecting  the  subsample  for  area  measure- 

ment    35 

5.  Enumeration  plan 35 

5.  1      Interviewing  and  supervision 36 

5.  2      Objective  measurement  of  area 36 

5.  21     Making  the  sketch  of  cropland  in  the 

holding 36 

5.  22     Determining  the  area 37 

6.  Preparation  of  estimates 37 

7.  Summary 37 


Chapter  IV-6.     ESTIMATES  OF   CHARACTERISTICS   FROM   THE    SAMPLE 


Introduction. 


Noninterview  adjustment 

2.  1      Types  of  noninterviews 

2.  2      Adjustment  for  noninterviewed  small 

holdings 

2.  3      Adjustment  for  noninterviewed  certainty 

holdings 


Adjustment  for  missing  area  measurements 
Identification  of  duplicate  cards 


38 
38 

38 

39 

39 

39 


5.  Alternative  adjustment  procedures 

6.  Possible  biases  in  adjustment  procedures 


Estimates  from  the  sample 

7.  1       Estimates  of  numbers  of  holdings   and  of 

aggregates 

7.  2      Estimates  of  ratios,    percentages,    averages 

and  proportions 

7.  3      Ratio  adjustment  of  estimates  of  totals  and 

aggregates 


8.      Use  of  sampling  errors  v/ith  the  estimates 


39 
40 

40 

40 

41 

41 
42 


Chapter  IV-7.     ESTIMATES  OF   SAMPLING   ERROR 


1 .  Introduction 4  3 

2.  Variance  of  a  sample  estimator 43 

2.  1      Variance  of  an  estimate  from  a  simple 

random  sample 43 

2.11     Definition  of  the  population  variance 4  3 

2.  12     Variance  for  a  sample  of  n  holdings 44 

2.  2      Variance  for  the  Agrostan  sample  design 45 

2.  21     Within-PSU  variance 45 

2.  22     Between-PSU  variance 45 

2.  23    Total  variance 45 

2.  3      Relative  variance  and  coefficient  of 

variation 45 

3.  Estimate  of  total  variance  from  the  sample 45 

3.  1      Effect  of  systematic  selection 45 

3.  2      The  grouped  stratum  method 46 

3.  3      Estimates  of  variance  for  districts 47 


3.  4      Estimates  of  variances  for  large  provinces.  ...  47 

3.  5      Estimates  of  variances  for  the  nation 47 

Estimate  of  components  of  variance  from 

sample 48 

4.  1      Between-PSU  variance 48 

4.  2      Within-PSU  variance 48 

4.  3      Remarks 48 

4.  4      Using  components  of  variance 48 

Variance  of  the  estimate  of  a  ratio  and  of  a  ratio 

estimator 49 

5.  1      Population  variance  of  a  ratio  estimator 49 

5.  2      Sample  estimate  of  the  variance  of  a  ratio 

estimator 49 

5.  3      Estimate  of  a  ratio  or  proportion 49 

Illustration  of  variance  computation 50 


Chapter  IV-8.     ALTERNATIVE    SAMPLE    DESIGNS 


Introduction 

Modifications  related  to  Census  objectives 

2.  1       Smaller  publication  areas  or  greater 

accuracy  

2.  2  Larger  publication  areas  or  less  accuracy 
2.  3  Different  requirements  for  different  items 
Modifications  related  to  size  of  country 


4.2 

4.  3 


4.4 


Modifications  related  to  availability  of  resources.  .  . 
4.  1      No  previous  agriculture  census 

No  previous  complete  population  census 

Simultaneous  sample  censuses  of  population 
and  agriculture 

More  detailed  maps  available 

4.41  Single-stage  segments  with  single 
measures  of  size 

4.42  Single-stage  segments  with  varying 
measures  of  size 

4.  43    Segments  as  second-stage  units 

4.  44    Use  of  cadastral  survey  maps 


51 
52 

52 
52 
53 
53 

54 
54 
54 

54 
55 

55 

55 
56 
56 


4.  5 
4.  6 

4.  7 


4.  8 


Less  detailed  maps  available  .  . 
More  money  and  staff  available 


56 
57 


Use  of  interpenetrating  subsamples 57 

4.  71     Successive  estimates 57 

4.  72    Evaluation  of  sampling  error 58 

4.  73    Evaluation  of  "enumerator  effect" 58 

Use  of  first-stage  ratio  estimates 59 


5.  Other  considerations  affecting  sample  design. 
5.  1  Number  and  definition  of  weight  groups.  . 
5.  2      Holdings  operated  by  city  residents 


5.  3 
5.4 


5.  5 
5.  6 


Communal  forms  of  tenure  .  . 

Use  of  "villages"  or  "barrios' 

of  PSU's 


as  a  frame 


Sampling  with  probability  proportional    to 

size 

Complete  agriculture  census  to  be  taken  .  . 


60 
60 

60 

60 
61 

61 
62 


CONTENTS 


Chapter  IV-9.      OTHER  USES  OF  SAMPLING 


Page 

1 .  Introduction 63 

2.  Sampling  in  pretesting 63 

Z.  1     Pretesting  questionnaires  and  other  forms 63 

2.  2     Pretesting  time  and  cost 64 

2.  3    Pretesting  office  procedures 64 

2.  31     Check-in  procedures 64 

2.  32    Office  processing 64 

2.  4    Experimental  census 64 

3.  Sampling  in  quality  control  operations 65 

3.  1     Quality  control  in  pre-enumeration 

activities 65 

3.  1  1     Mapping 65 

3.  12     Format  of  questionnaires  and  listing 

books 65 


Page 

3.  13     Printing 65 

3.  14    Review  of  training  materials  and 

manuals 6  5 

3.  15     Distribution  of  materials 65 

3.  2    Quality  control  in  field  enumeration 66 

3.  21     Checking  PSU  boundaries 66 

3.  22    Observation  of  enumeration 66 

3.  3     Quality  control  of  office  processing 66 

3.  31     Editing,    coding,    keypunching, 

tabulating 66 

3.  32    Control  of  materials 66 

3.  33    Review  of  data 66 

Sampling  in  tabulations 44 

Sampling  for  an  evaluation  survey 67 

Sampling  for  intercensal  surveys 67 


Chapter  IV- 10.      SUMMARY  OF  ACTIVITIES 


1.     Introduction. 


1.  1 


General  responsibilities  of  the  sampling 
technician 


Page 
68 


(,H 


1.  2 


1.  3 


Page 
Summary  of  specific   sampling  procedures  in 

Agrostan 69 

1.  21     Provinces 69 

1.  22     Northern  Territory 69 

Flow  chart  of  sampling  activities 70 


REFERENCES 
Selected  list  of  references 71 

APPENDIX 


Exhib 
IV-2- 


IV-2-2. 


IV-2-3. 


IV-2-4. 


IV-2-5. 


IV-2- 
IV-2- 
IV-2- 


IV-3-1. 


IV-3-2. 


IV-3-3. 


IV-3- 
IV-3- 


IV-3-6. 


IV-3 
IV-3 


-7. 


Worksheet,    Form  P-270:     Preliminary  count 
from  the   1970  Census  of  Population  for 
Ranchi  District  in  Rajpur  Province 

Preliminary  population  count:     Rural  enu- 
meration areas  by  size  groups,    for 
districts  in  Rajpur  Province:     1970 

Preliminary  population  count:     Urban  enu- 
meration areas  by  size  groups,    for 
districts  in  Rajpur  Province:     1970 

Preliminary  population  count:  Rural  enu- 
meration areas  by  size  groups,  for  the 
Provinces:     1970 

Preliminary  population  count:  Urban  enu- 
meration areas  by  size  groups,  for  the 
Provinces:     1970 

Estimated  sample  size,    for  the  Provinces.  .  .  . 

Estimated  sample  size,    for  Rajpur  Province  . 

Simple  method  for  optimizing  sample  design.  . 


Page 


7s 


76 


76 


77 
78 
79 
8  0 


IV-4-1. 


List  of  1961  large  holdings  (Provinces), 
Form  A -3,    for  Ranchi  District  in  Rajpur 
Province 82 

1971  National  Certainty  Listing  (Provinces), 
Form  A -2c,  for  Ranchi  District  in  Rajpur 
Province 84 

Preliminary  count:     Number  and  population  of 
urban  places,    by  size  of  place,    for  the 
Provinces:     1970 

PSU  card  (Provinces),    Form  A-260 88 

Map  of  Cordoba  city   in  Warangol  District, 

Rajpur  Province 89 

Map  of  Ranchi  District  in  Rajpur  Province 
showing  enumeration  areas  for  the  1970 
Census  of  Population 9  0 

District  summary- -PSU 's  in  sample, 

Form  A-252,    for  Rajpur  Province 91 

PSU  Sample  Selection  Sheet  (Provinces), 
Form  A-261  for  Ranchi  District  in 
Rajpur  Province 92 

Listing  Book  (Provinces),    Form  A-2a,    for 

PSU  19  in  Ranchi  District,  Rajpur  Province.  .        93 


Exhibit  Page 

IV-5-1.      Preliminary  count:     Chiefdoms  by  size  groups, 

for  the  Northern  Territory:     1970 106 

IV-5-2.     Preliminary  population  count:     Special  enu- 
meration areas  by  size  groups,    for  the 
Northern  Territory:     1970 106 

IV-5-3.     Preliminary  population  count:     Urban  enumera- 
tion areas  by  size  groups,    for  the  Northern 
Territory:     1970 106 

IV-5-4.     Estimated  sample  size  for  the  Northern 

Territory 1  07 

IV-5-5.     Enumerator's  map  sketch  of  Damascus 

Chiefdom  in  the  Northern  Territory 108 

IV-5-6.      Listing  Book  (Northern  Territory),    Form  A -2b, 
for  Damascus  Chiefdom  in  the  Northern 
Territory 1  09 

IV-7-1.      Worksheet:     Computation  of  variances  -  - 

grouped-stratum  method,    Form  A-281 117 

IV-  1  0-  1 .  Flow  chart  of  sampling  operations 118 

IV-  1  0-2.  Calendar  of  sampling  activities 119 


Other  exhibits  for  reference 

1-2-17.      Map  location,    area,    urban  and  rural  popula- 
tion,   and  rural  population  density,    for 
Districts:     1960 123 

1-2-21.      Number  of  agricultural  holdings  by  size  of 

holding,    by  Provinces:     1961 125 

1-2-34.      Population  by  age  and  sex,    urban  and  rural: 

1960 126 

1-2-35.      Number  of  persons  in  the  household,    urban 

and  rural:     1960 127 

II— 2  —  1 .       Calendar  of  major  census  activities 128 

II-3-4.       Example  of  a  subdivided  sample  PSU 129 

II-3-6.        District  map  showing  sample  PSU's  and 

location  of  National  Certainty  holdings 130 

III-2-3.      1971  Census  of  Agriculture  Questionnaire 

(Northern  Territory),    Form  A-lb 131 

VI-2-4.      PSU  control  record  for  map  preparation, 

Form  A-271 135 


Digitized  by  the  Internet  Archive 

in  2012  with  funding  from 

LYRASIS  Members  and  Sloan  Foundation 


http://archive.org/details/agrostancasestudOOunit 


Chapter  IV  -  1.    DECISIONS  AFFECTING  DESIGN  OF  THE  CENSUS 


1.    INTRODUCTION 

As  part  of  the  initial  planning  for  Agrostan's 
Census  of  Agriculture,  basic  data  needs  of  the 
country  were  determined  in  conversation  with  ma- 
jor users  of  agricultural  statistics.  From  this 
information,  key  decisions  were  made  (see  Unit  II 
of  the  case  study)  on  such  important  questions  as: 

What  statistics  should  be  collected  in  the 
census,  for  what  level  of  publication,  and 
with  what  reliability? 

Should  the  data  be  collected  by  a  complete 
census  or  a  sample  census? 

What  should  be  the  general  approach  in  de- 
signing the  sample,  once  the  decision  is  made 
to  take  a  sample  rather  than  a  complete  census? 

This  chapter  (chapter  IV-l)  outlines  the  con- 
siderations involved  in  making  the  decision  to  take 
a  particular  type  of  sample  census  of  agriculture 
and  gives  some  basic  principles  that  should  be 
followed  in  designing  and  selecting  the  sample. 
Later  chapters  provide  a  more  detailed  discussion 
of  the  sample  design  and  estimation  procedures. 

2.    T  YPES  AND  RELIABILIT  Y  OF  ST  A  TISTICS 
REQUIRED 

2.1    Types  of  statistics  required 

The  needs  for  and  uses  of  agricultural  sta- 
tistics were  discussed  earlier  in  the  case  study 
(see  chapter  1  of  Unit  I).  It  is  apparent  that 
different  users  require  many  different  types  of 
statistics,  and  that  there  is  no  single  most  im- 
portant item  to  be  obtained  in  an  agriculture 
census.  Furthermore,  the  statistics  may  concern 
different  segments  of  the  population.  Tenure  of 
holding,  size  of  holding,  land  utilization,  size 
of  household,  and  income  of  the  holder's  family 
are  typical  items  of  important  information  that 
apply  to  every  holding  in  Agrostan.  Just  as  im- 
portant to  the  economic  development  plans  of 


Agrostan  are  many  characteristics  which  can  be 
associated  with  only  a  subgroup  of  the  holdings  in 
the  country.  For  example,  only  the  larger  hold- 
ings may  report  use  of  mechanical  power,  produc- 
tion of  particular  types  of  livestock  or  crops, 
and  use  of  chemical  fertilizer.  Therefore,  the 
census  must  be  designed  so  that  data  on  a  great 
variety  of  characteristics,  distributed  different- 
ly among  the  agricultural  holdings,  can  be  col- 
lected at  an  acceptable  level  of  reliability.  The 
table  outlines  in  Unit  III  illustrate  the  differ- 
ent types  of  data  which  will  be  required  from  the 
Agriculture  Census  of  Agrostan. 

2.2  Level  of  publication 

The  administrators,  economic  planners,  and 
other  users  of  agricultural  statistics  also  dic- 
tate the  level  of  publication  for  the  census .  If 
the  data  requirements  of  the  country  are  confined 
to  the  National  level,  then  the  amount  of  effort 
required  to  collect,  process,  tabulate,  and  pub- 
lish statistics  for  the  nation  as  a  whole  will  be 
considerably  less  than  that  required  to  produce 
separate  statistics  for  districts  or  Provinces . 
The  effect  of  the  level  of  publication  on  the  re- 
quired size  of  sample  is  particularly  important. 
In  the  case  of  Agrostan,  it  was  determined  that 
more  detailed  cross -classifications  are  needed  at 
the  Province  and  National  levels  than  at  the  dis- 
trict level  (or  zone  level  in  the  Northern  Terri- 
tory). The  census  must  be  designed,  therefore, 
so  that  important  data  can  be  published  at  the 
district  (or  zone)  level  in  the  form  of  simple 
distributions  of  characteristics  and  more  detailed 
distributions  and  cross-classifications  can  be 
published  at  the  Province  and  National  levels . l 


Data  will  not  be  published  separately  for  very  small 
districts;  these  will  be  combined,  and  figures  published 
for  the  combined  area. 
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2.3   Level  of  reliability 

Another  initial  decision  affecting  sample 
size  and  design  concerns  the  minimum  level  of  re- 
liability that  can  be  tolerated  for  the  majority 
of  characteristics  on  which  data  are  to  be  Col- 
lected in  the  census.  The  major  users  of  the 
data  should  help  to  make  this  determination.  For 
example,  if  a  planning  economist  indicates  that 
he  would  make  the  same  decision  even  though  a 
statistic  were  10$  higher  or  lower  than  reported, 
then,  for  his  purposes,  it  is  not  necessary  to  in- 
cur the  expense  of  planning  a  census  with  errors 
smaller  than  10$ .  Most  users  of  statistics  do  not 
think  about  data  in  terms  of  minimum  level  of 
sampling  error,  however,  and  the  statisticians 
usually  must  take  the  initiative  in  this  decision. 

The  errors  in  estimates  from  a  multi-purpose 
sample  survey  will  differ  for  different  charac- 
teristics, depending  on  the  variability  of  the 
characteristics,  or — in  the  case  of  attributes — 
depending  on  the  proportion  of  units  having  the 
attribute.  An  item  which  does  not  vary  much  among 
different  holdings,  or  an  attribute  possessed  by 
a  large  proportion  of  all  holdings,  can  be  esti- 
mated with  greater  accuracy  than  other  items .  The 
desired  reliability  must,  therefore,  be  set  either 
for  a  very  important  class  of  items,  or  a  typical 
item,  with  the  understanding  that  the  accuracy  of 
other  items  will  vary  according  to  their  relation- 
ships to  the  basic  items .  These  expected  varia- 
tions must  also  be  checked  for  acceptability. 

Districts  in  the  Provinces  and  zones  in  the 
Northern  Territory  are  the  smallest  administrative 
divisions  for  which  separate  estimates  will  be 
published  from  the  sample  census  of  agriculture, 
and  data  users  will  expect  a  reasonable  level  of 
accuracy  for  such  estimates.  However,  the  dis- 
tricts making  up  the  13  Provinces  vary  in  size 
from  less  than  5,000  to  over  100,000  rural  popu- 
lation (according  to  the  1960  Census  of  Popula- 
tion) ,  and  the  three  zones  of  the  Northern  Terri- 
tory vary  from  an  estimated  74,000  to  106,000 
rural  population  (exhibit  1-2-17).  If  the  same 
sample  plan  is  used  in  all  districts  and  zones, 
therefore,  the  larger  administrative  divisions 


will  have  larger  samples  and  more  accurate  esti- 
mates than  the  smaller  ones .   In  order  to  have 
a  uniform  sampling  plan  and  a  relatively  easy 
method  of  analyzing  results,  it  was  decided 
after  considering  several  alternatives,  to  estab- 
lish a  desired  level  of  accuracy  for  the  average 
district  (or  combination  of  small  districts).   In 
the  Northern  Territory,  a  somewhat  different  mini- 
mum level  of  desired  accuracy  was  set  for  the  zone 
estimates,  with  the  condition  that  it  should  apply 
to  all  three  zones,  as  explained  below. 

2-31  Provinces .--In  the  Provinces,  the 
standard  of  reliability  for  district  estimates  in 
an  average -size  district  was  set  at  5$  coefficient 
of  variation  for  estimates  of  characteristics  asso- 
ciated with  half  of  all  holdings,  and  correspond- 
ingly larger  errors  would  be  allowed  for  smaller- 
than-average  districts,  and/ or  for  characteristics 

2 

associated  with  less  than  half  the  holdings. 
Characteristics  associated  with  more  than  half  of 
all  holdings  are  expected  to  have  smaller  errors; 
for  example,  an  estimate  of  a  characteristic  asso- 
ciated with  all  holdings,  such  as  their  average  size 
or  the  total  land  in  all  holdings,  might  have  a  co- 
efficient of  variation  of  about  2.5$  in  an  average - 
size  district.  Some  very  small  districts,  which 
would  have  very  large  errors,  are  to  be  combined  for 
publication  purposes  (see  chapter  2) . 

At  the  Province  level,  the  errors  discussed 
above  might  be  reduced  from  5$  and  2.5$  to  about 
2$  and  1$,  respectively.   Estimates  for  subgroups, 
such  as  tenure  classes,  would  have  somewhat  larg- 
er, but  acceptable,  errors  at  the  Province  level. 
Errors  for  estimates  of  the  same  characteristics 
for  the  13  Provinces  as  a  whole  would  be  about  a 
third  as  large  as  those  for  individual  Provinces . 

2.32  Northern  Territory. --In  the  Northern 
Territory,  each  of  the  zones  is  larger  than  an 
average  district  in  the  Provinces .  With  only 


The  coefficient  of  variation  (relative  standard  error) 
is  the  standard  error  divided  by  the  estimate  and  is  ex- 
pressed as  a  percentage.   In  order  to  obtain  a  95%  confi- 
dence level,  use  two  times  the  coefficient  of  variation. 

The  number  of  districts  in  a  Province  ranges  from  8  to 
20;  thus,  the  Province  samples  are  many  times  larger  than 
the  district  samples. 
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three  zones,  even  the  smallest  of  them  is  suffi- 
ciently important  to  have  a  specified  accuracy, 
rather  than  setting  a  desired  level  for  the  aver- 
age zone.  The  limit  of  error  for  the  smallest 
zone  was  set  at  a  slightly  higher  level  (lower 
accuracy)  than  the  standard  required  for  the 
average  district.  The  detail  requirements  (esti- 
mates for  subgroups)  were  also  much  more  limited 
for  the  Northern  Territory.  It  was  agreed  that 
sampling  errors  for  the  smallest  zone  may  he  as 
large  as  5%   or  10$  without  leading  to  wrong  de- 
cisions by  data  users.  The  larger  zones  will  then 
have  smaller  errors.  Because  of  differences  in 
data  requirements  and  field  conditions,  and  ex- 
pected difficulties  in  data  collection,  the  sample 
design  for  the  Northern  Territory  is  different 
from  that  for  the  Provinces.  It  also  illustrates 
another  application  of  the  principles  of  sampling 
adapted  to  these  differences  and  difficulties. 
The  sample  design  and  data-collection  plans  for 
the  Northern  Territory  are,  therefore,  discussed 
separately — see  chapter  5  of  Unit  IV  of  the  case 
study. 

3.    SAMPLE  OR  COMPLETE  CENSUS 

The  decision  on  the  types  of  data  required 
and  the  publication  level  and  reliability  level 
is  closely  related  to  the  decision  on  whether  to 
take  a  complete  census  (enumerate  every  holding) 
or  a  sample  census  (enumerate  a  sample  of  hold- 
ings), or  possibly  a  combination  of  the  two. 

3.1   Advantages  of  a  complete  census 

Some  of  the  advantages  of  taking  a  complete 
census  are  listed  below.  Each  implies  a  corres- 
ponding disadvantage  of  taking  a  sample  census. 

(1)  There  is  no  sampling  error  in  a  complete 
census.   (Nonsampling  errors,  however,  are 
inherent  in  a  complete  census  as  well  as  in 
a  sample  census;  refer  to  section  3.4.) 

(2)  Data  can  be  produced  for  relatively  small 
administrative  areas. 

(3)  The  data  may  be  more  readily  accepted  by 
some  users. 

(4)  Since  preparation  of  a  sampling  frame  (list 
of  holdings)  and  selection  of  sample  holdings 


would  not  be  required,  the  field  procedures 
and  the  procedures  for  analysis  of  the  tabu- 
lated results  are  generally  less  complex 
than  those  required  for  a  sample  census. 


3.2  Advantages  of  a  sample  census 

On  the  other  hand,  there  are  many  advantages 
in  taking  a  sample  census.  Some  of  the  more  im- 
portant advantages  include  the  following: 

(1)  In  most  situations,  the  costs  of  a  sample 
agriculture  census  in  terms  of  money,  mate- 
rials, and  personnel  are  considerably  less 
than  those  of  a  complete  census. 

(2)  The  data  can  be  made  available  to  the  users 
at  a  much  earlier  date  than  is  possible  in  a 
complete  census.  In  many  countries,  collec- 
ting the  information  and  processing  a  large 
number  of  questionnaires  (approximately 
1,000,000  for  Agrostan)  provides  a  serious 
obstacle  to  early  publication  of  the  census 
results.  In  comparison,  the  processing  and 
tabulation  of  questionnaires  for  a  sample 
(around  75,000  for  Agrostan)  can  be  comple- 
ted before  the  results  become  out-of-date. 

(3)  A  sample  census  requires  fewer  trained  per- 
sons as  enumerators  and  supervisors.  At  a 
rate  of  80  to  100  interviews  per  enumerator, 
a  sample  census  of  Agrostan  would  require 
only  800  to  900  enumerators  and  supervisors. 
A  complete  census  in  Agrostan,  at  a  rate  of 
200  interviews  per  enumerator  (allowing  for 
less  travel),  would  require  at  least  5,000 
enumerators  and  supervisors.  To  accomplish 
such  a  large  task,  it  may  be  necessary  to 
disrupt  other  government  activities  to  pro- 
vide the  required  personnel  and  other  re- 
sources; indeed,  the  task  may  be  impossible. 

(4)  It  is  possible  to  select  the  smaller  group 
of  sample  census  enumerators  more  carefully 
and  to  train  them  more  intensively,  with  the 
expected  result  that  nonsampling  errors  will 
be  smaller  in  a  sample  census  than  in  a  com- 
plete census . 


3.3  Decision  to  take  a  sample  census 

Every  one  of  the  advantages  of  taking  a  sam- 
ple census  applies  to  Agrostan's  situation.  At 
the  same  time,  all  of  the  advantages  of  taking  a 
complete  census  apply  to  some  extent.  The  Work- 
ing Group  and  the  sampling  technicians  of  the 
National  Statistical  Office  (NS0),  after  deter- 
mining the  type  and  reliability  of  data  that  were 
required  and  considering  the  advantages  and  dis- 
advantages of  a  complete  census  as  compared  with 
a  sample  census,  reached  the  decision  that  the 
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census  data  could  be  most  efficiently  collected 
through  the  following  plan: 

(1)  A  complete  enumeration  of  all  the  very  large 
holdings  and  those  with  specialized  agricul- 
tural operations . 

(2)  A  sample  of  the  small  and  the  medium-size 
holdings . 

This  decision  took  into  consideration  that 
(a)  funds  to  carry  out  a  complete  enumeration  of 
holdings  would  likely  not  he  available,  (b)  the 
advantages  of  a  sample  census  (lower  cost,  improved 
quality,  etc.)  seemed  much  greater  than  the  ad- 
vantages of  a  complete  census,  (c)  the  disadvan- 
tages of  taking  a  sample  census  (in  terms  of  sam- 
pling error,  added  complexities  associated  with 
sample  selection  and  processing)  could  be  con- 
trolled, and  (d)  the  most  important  agricultural 
operations  in  the  country,  accounting  for  a  very 
large  part  of  the  total  agricultural  production, 
could  be  identified  and  included  in  the  census 
with  certainty,  at  very  little  cost.  The  decision 
also  took  account  of  the  need  to  provide  a  frame 
for  conducting  intercensal  surveys  - 

3.4  Effects  of  nonsampling  errors 

Errors  associated  with  the  results  of  a  sam- 
ple census  are  of  two  general  types:   (a)  sampling 
error,  which  results  from  enumerating  a  sample 
rather  than  the  complete  population,  and  (b)  non- 
sampling  errors,  which  result  from  inaccurate  re- 
sponses; incompleteness  of  field  work;  errors  in 
editing,  coding,  and  processing;  and  other  errors 
or  failures  not  related  to  sampling. 

Any  response  problem  which  arises  in  a  com- 
plete census  will  most  likely  occur  also  in  a 
sample  census.  A  sample  census  cannot  be  expected 
to  eliminate  response  errors,  but  will  reduce  the 
number  of  problem  cases  which  must  be  examined, 
and  thus  may  require  the  expenditure  of  a  smaller 
amount  of  total  effort.  Furthermore,  the  use  of 
a  sample  will  permit  a  more  thorough  investiga- 
tion of  each  problem  case;  thus  nonsampling  errors 
can  be  better  controlled  in  a  sample  census. 

There  is  reason  to  believe  that  response 
errors  are  particularly  important  in  agriculture 
censuses.  For  Agrostan,  it  was  decided  to  place 


emphasis  on  controlling  nonsampling  errors  through 
great  care  in  selecting  enumerators  and  super- 
visors and  through  an  intensive  training  program. 

4.    BASIC  PRINCIPLES  IN  SAMPLE  DESIGN 

A  number  of  basic  principles  were  kept  in 
mind  in  designing  the  sample  census  for  Agrostan. 
These  principles  may  be  summarized  as  follows : 

(1)  Before  undertaking  any  census  (either  a  com- 
plete or  a  sample  census),  one  must  decide 
what  data  are  to  be  collected  and  the  popu- 
lation from  which  they  are  to  be  obtained. 
One  must  define  the  "universe." 

(2)  By  some  means,  a  "frame"  must  be  established 
from  which  a  sample  can  be  selected.  The 
frame  may  be  made  up  of  several  parts;  for 
example,  a  basic  frame  of  ordinary  holdings 
and  a  supplemental  frame  of  holdings  with 
specialized  agricultural  operations.  In  to- 
tal, the  frame  should  cover  the  whole  "popu- 
lation" being  studied. 

(3)  Every  holding  in  the  population  should  have 

a  chance  of  being  selected  in  the  sample;  and 
this  chance  of  being  selected  should  be  known. 

(4)  The  sample  design  should  be  as  simple  and 
straightforward  as  possible  so  that  it  can 
be  carried  out  as  planned. 

(5)  It  must  also  be  possible  to  carry  out  the 
sampling  plan  within  the  time  limits  and 
financial  limits  which  have  been  established 
administratively;  if  the  plan  fails  in  these 
respects,  the  results  may  not  be  available 
at  the  time  when  they  are  needed,  or  indeed 
the  census  may  never  be  finished.  Data  may 
be  collected,  but  there  may  be  no  money  for 
tabulation,  or  for  analysis  of  the  results. 

(6)  Unbiased  procedures  should  be  used  in  plan- 
ning and  carrying  out  the  census. 

(7)  It  should  be  possible  to  estimate  the  re- 
liability of  the  estimates  from  the  sample 
data  themselves. 

(8)  In  choosing  from  the  alternative  sample  pro- 
posals which  would  lead  to  a  suitable  sample 
design,  one  should  use  a  method  which  yields 
the  maximum  amount  of  information  for  the 
amount  of  money  expended. 

(9)  By  means  of  adequate  supervision  and  control, 
one  must  actually  carry  out  the  survey  in 
accordance  with  the  specifications  which 
have  been  prescribed. 

Principles  6,  7,  and  8  lead  to  the  selection 
of  a  probability  sample . 


Chapter  IV  -  2.    SAMPLE  DESIGN  AND  ENUMERATION  PLAN  FOR  THE  PROVINCES 


This  chapter  describes  the  sample  design  and 
enumeration  plan  for  the  13  Provinces,  which  con- 
tain about  95%  of  the  total  population  of  Agrostan. 
Methods  of  selecting  the  sample  units  are  discussed 
in  greater  detail  in  chapters  3  and  4.   Plans  for 
the  Northern  Territory,  which  make  up  the  remainder 
of  the  country,  are  described  in  chapter  5. 

1 .    USE  OF  RESO  UR  CES  A  VAIL  ABLE  FROM 
PRE  VIO  US  CENS  USES 

In  designing  the  sample  for  the  1971  Sample 
Census  of  Agriculture,  the  sampling  technician 
will  make  use  of  the  experience  gained  in  the 
1961  Sample  Census  of  Agriculture .  Materials 
from  the  1970  Population  Census  also  will  be  used. 
One  of  the  first  steps  in  planning  the  sample  will 
be  to  collect  available  data  and  resources  for  use 
in  the  development  of  an  efficient  sampling  plan. 

1.1   Description  of  the  1961  Sample 
Census  of  Agriculture 

The  1961  Sample  Census  of  Agriculture  was  the 
first  agricultural  census  taken  in  Agrostan.  Be- 
cause of  a  shortage  of  funds  and  of  persons  experi- 
enced in  agricultural  data  collection,  it  was 
limited  in  scope  and  objectives .  The  Northern 
Territory  was  excluded  entirely,  and  the  census 
in  the  Provinces  was  a  sample  rather  than  a  com- 
plete enumeration.  During  the  enumeration,  it  was 
discovered  that  the  Census  program  had  been  under- 
budgeted  and  inadequately  planned,  so  that  the 
quality  of  some  of  the  information  did  not  meet 
the  expectations  of  the  planners.   In  fact,  re- 
sults for  several  items  were  not  published  because 
it  was  believed  that  the  concepts  had  not  been 
interpreted  correctly.  Nevertheless,  the  remaining 
data  gave  rough  estimates  of  production  and  inven- 
tory figures  for  a  number  of  important  agricultural 
items,  and  the  experience  gained  will  be  useful  in 
planning  the  second  census  of  agriculture . 


1.11  Level  of  publication  and  reliability. — 
Data  were  collected  on  the  agricultural  structure 
in  each  of  the  13  Provinces  of  Agrostan  for  the 
purpose  of  planning  for  economic  development. 

The  Census  obtained  basic  data  on  tenure,  kinds 
of  crops  grown,  area  of  crops  harvested,  production 
for  six  of  the  more  important  crops,  land  utili- 
zation, livestock  numbers,  expenditures,  and  in- 
debtedness. Cross -tabulations  of  the  data  were 
made  for  a  few  characteristics .  National  esti- 
mates were  prepared  in  slightly  greater  detail 
than  estimates  for  individual  Provinces . 

No  reliability  specifications  were  set  before 
conducting  the  census .  Estimates  were  published 
for  the  nation  as  a  whole  (excluding  the  Northern 
Territory)  and  for  individual  Provinces,  but  not 
for  the  districts . 

1.12  Sample  design. — There  were  no  trained 
sampling  technicians  in  the  National  Statistical 
Office  for  the  1961  Census .  The  decision  to  con- 
duct interviews  for  a  sample  of  holdings  was 
based  primarily  on  the  fact  that  the  available 
funds  for  the  agriculture  census  were  insufficient 
for  a  complete  enumeration.  The  sample  design 
was  very  simple .  Every  20   rural  enumeration 
area  (EA)1  from  the  1960  Population  Census  was 
selected  for  the  sample --a  total  of  approximately 
550  EA's.  All  holdings  within  the  sample  EA's 
were  enumerated  for  the  Agriculture  Questionnaire . 
This  basic  sample  was  supplemented  with  approxi- 
mately 3,500  large  holdings  (having  500  hectares 
or  more),  which  were  included  in  the  sample  with 
certainty.  The  total  sample  size  was  over  40,000 


In  Agrostan,  an  enumeration  area  is  an  administrative 
area  for  purposes  of  taking  the  population  census.   In 
urban  areas,  one  EA  was  assigned  to  several  enumerators 
so  that  the  census  could  "be  completed  in  1  or  2  days;  in 
rural  areas,  an  EA.  was  assigned  to  one  enumerator  to  com- 
plete in  1  or  2  weeks.  Any  problem  in  the  definition  of 
EA' s  for  the  1960  Population  Census  was  a  potential  prob- 
lem for  the  1961  Census  of  Agriculture. 
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holdings  (an  average  of  70  to  75  holdings  per  EA) . 

1.13  Enumeration  plan. —  The  1961  enumeration 
relied  largely  on  the  Ministry  of  Agriculture 
personnel.  In  most  cases,  sample  EA's  were  as- 
signed to  the  Agricultural  Agent  in  whose  area 
they  were  located.  After  a  brief  training,  the 
agent  was  instructed  to  visit  his  sample  EA  (or 
EA's)  and  conduct  interviews  at  every  holding. 
Upon  completion,  the  questionnaires  were  delivered 
to  the  Province  Office  of  the  Ministry  of  Agri- 
culture and  later  transmitted  to  the  National 
Statistical  Office .  There  was  very  little  checking 
of  the  enumerator ' s  work  in  the  field . 

1.2  Use  of  the  1961  agricultural  data 

One  of  the  published  tables  from  the  1961 
Census  shows  the  number  of  holdings  in  various 
size  categories  (see  exhibit  1-2-21,  which  is  in- 
cluded as  a  reference  exhibit  in  Unit  IV) .  This 
distribution  of  holdings  by  size  will  be  useful 
in  determining  how  much  of  the  sample  for  the  1971 
Census  should  consist  of  large  holdings .   It  will 
also  help  in  defining  "large  holdings"  for  the 
purpose  of  the  1971  Agriculture  Census . 

In  addition  to  providing  data  for  guiding 
the  sample  selection,  results  of  the  1961  Census 
experience  led  to  the  following  judgments  con- 
cerning the  1971  Census : 

(1)  Objectives  should  be  more  clearly  defined. 

(2)  District  level  data  will  determine  the  relia- 
bility required  of  the  sample .  Data  for  Prov- 
inces will  then  be  sufficiently  accurate . 

(3)  A  subsample  of  holdings  within  the  sample 
EA's  should  be  used  to  obtain  a  more  efficient 
sample  than  could  be  obtained  by  complete  enu- 
meration of  all  holdings  in  the  EA. 

(4)  The  census  enumeration  should  be  carried  out 
by  persons  hired  for  the  census  and  directly 
responsible  to  the  NSO,  rather  than  by  persons 
primarily  responsible  for  non-census  activities. 

1.3  Use  of  the  1970  Population  Census 

The  chief  source  for  selecting  the  sample 
for  the  1971  Agriculture  Census  will  be  the  pre- 
liminary population  counts  and  the  maps  "from  the 
1970  Census  of  Population,  which  will  precede  the 


1971  Sample  Census  of  Agriculture.   The  1970 
Population  Census  EA's,  with  some  modifications, 
will  provide  an  up-to-date  sampling  frame  of 
small  census  divisions  covering  the  entire  area 
of  the  Provinces . 

Through  the  improved  mapping  program  (dis- 
cussed in  Unit  II  of  the  case  study) ,  maps  from 
the  1970  Population  Census  will  be  available  for 
the  agriculture  census;  it  is  expected  that  a 
number  of  these  maps  will  have  been  brought  up 
to  date  during  the  population  census .  The  Geog- 
raphy Division  staff  will  inspect  the  population 
census  maps  for  the  EA's  that  fall  in  the  agri- 
culture census  sample,  add  whatever  identification 
is  needed,  and  generally  improve  the  quality  of 
the  maps  when  it  is  feasible  to  do  so.   In  some 
instances,  new  maps  will  need  to  be  prepared  to 
replace  inadequate  and  lost  maps. 

It  is  hoped  that  some  of  the  field  super- 
visors and  enumerators  in  the  1970  Population 
Census  will  also  be  available  for  the  agriculture 
census.  Thus,  the  timing  of  the  1971  Agriculture 
Census  makes  possible  the  efficient  use  of  some 
important  resources  which  will  be  reflected  in 
the  sample  design  and  enumeration  plan. 

1.4   Preliminary  1970  data 

Since  final  population  counts  will  not  be 
available  in  time  to  plan  and  select  the  sample 
for  the  agriculture  census,  preliminary  counts 
will  be  sufficiently  accurate  for  planning  and  sam- 
ple selection.  The  preliminary  counts  of  EA's  will 
be  transcribed  to  a  worksheet,  which  can  be  con- 
veniently used  for  planning  the  sample  design. 
The  sampling  technician  for  the  1971  Agriculture 
Census  should  work  closely  with  the  staff  in  the 
Population  Division  of  the  NSO  to  insure  that  the 
necessary  counts  will  be  compiled  and  will  be 
available.  Exhibit  IV-2-1  gives  preliminary  1970 
population  counts  by  EA's  for  Ranchi  District  in 
Ra jpur  Province .  Exhibits  IV-2-2  to  IV-2-5  give 


The  1970  Census  of  Population  will  be  taken  as  of  1  May 
1970;  the  1971  Sample  Census  of  Agriculture  will  be  con- 
ducted in  November-December  1971  in  the  Northern  Territory 
and  in  January-February  1972  in  the  13  Provinces. 
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district  summaries  of  urban  and  rural  EA's  for 
Rajpur,  and  Province  summaries  for  all  Provinces. 

Additional  preliminary  data  useful  for  the 
overall  planning  of  the  sample  census  of  agricul- 
ture are  given  in  table  2A;  they  provide  a  rough 
idea  about  the  country.   (Text  tables  carry  the 
chapter  number  followed  by  a  letter.) 

Table  2A.   PRELIMINARY  DATA  FOR  PROVINCES:  1970 


Item 


Population,  total  Provinces 

Rural 

Percent  of  total 


Number  of  EA's,  total  Provinces 

Rural 

Percent  of  total 


Average  population  per  rural  EA. . . 

Average  number  of  housing  units 
per  rural  EA  (estimated) 

Average  number  of  holdings  of  1/10 
hectare  or  more  per  rural  EA 
(estimated) 


Approximate  number  of  districts 
(or  groups  of  districts)  for 
which  separate  estimates  are  to 
be  made* 

Average  number  of  rural  EA's  per 
district  having  separate  esti- 
mates   


Number 


9,374,552 
6,090,387 

65.0 

14,936 

11,826 

79.2 

515 

95 


S5 


150 


79 


*This  number  was  decided  after  inspection  of 
the  1970  Population  Census  results . 


2.    OUTLINE  OF  THE  1971  SAMPLE  DESIGN 

In  this  section,  it  is  assumed  that  several 
alternative  sample  designs  satisfying  the  basic 
principles  of  chapter  1  (section  4)  have  been 
considered  and  a  choice  has  been  made .  The  design 
agreed  upon  for  the  sample  in  the  Provinces  is 
described  in  the  remainder  of  this  chapter  (chapter 
2).  Chapters  3  and  4  describe  the  office  and  field 
operations  required  to  select  the  sample  in  the 
Provinces .  Chapter  5  discusses  the  sample  design 
and  selection  in  the  Northern  Territory. 

The  1971  sample  design  for  Agrostan  can  be 
described  generally  as  a  two-stage  probability 
sample,  although  some  holdings  are  selected  in  a 
single  stage .  Basically,  the  design  involves 


selecting  a  sample  of  primary  sampling  units 
(PSU's)  from  which  a  subsample  of  holdings  will 
be  selected.  A  number  of  features  have  been  in- 
troduced in  the  design,  however,  to  increase  its 
efficiency. 

2.1    Definition  of  the  universe 

The  universe  or  population  of  units  about 
which  agricultural  information  will  be  obtained 
consists  of  all  agricultural  holdings  with  two 
exceptions : 

(1)  Holdings  of  less  than  1/10  hectare  will  be 
excluded  unless  they  are  so  important  as  to 
be  on  the  list  of  "National  Certainty  hold- 
ings" described  below.  A  count  of  holdings 
of  less  than  1/10  hectare  will  be  obtained, 
however . 

(2)  Holdings  located  in  the  built-up  parts  of 
cities  of  over  5,000  persons  will  be  excluded 
unless  they  are  on  the  list  of  "National 
Certainty  holdings."  (There  is  a  possibility 
that  a  holding  located  in  the  built-up  part 
of  a  city  will  have  an  operator  who  lives 
outside  that  part  of  the  city.  Such  opera- 
tors and  their  holdings  are  included . ) 

These  exceptions  in  the  universe  definition  will 
greatly  reduce  the  cost  of  obtaining  the  infor- 
mation, while  having  only  a  very  slight  effect  on 
the  estimates  that  will  be  prepared.   It  is  be- 
lieved that  the  excluded  holdings  account  for  less 
than  2fo   of  the  total  agricultural  production  of 
Agrostan  in  1971 . 


2.2  Reliability  required 

Separate  estimates  are  required  for  each  of 
150  districts  or  combined  groups  of  districts 

3 

making  up  the  13  Provinces .   Estimates  for  the 
average-size  district  should  have  a  coefficient 
of  variation  of  about  5%   for  characteristics  asso- 
ciated with  half  the  holdings  and  a  coefficient  of 
variation  of  about  2.5%  for  characteristics  asso- 

4  4 

ciated  with  all  the  holdings .   Larger  sampling 


The  172  districts  in  the  Provinces  were  reduced  to  150 
by  combining  the  very  small  ones  (those  with  an  estimated 
total  number  of  holdings  under  2,500)  to  get  areas  suit- 
able for  making  separate  estimates. 

The  coefficient  of  variation  of  an  estimate  is  defined 
as  the  standard  error  of  the  estimate  divided  by  the  esti- 
mate.  Thus,  two  times  the  coefficient  variation  will  give 
the  relative  error  at  the  95%   confidence  level. 
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errors  will  be  acceptable  for  small  districts  and 
for  characteristics  associated  with  less  than  half 
the  holdings . 

As  stated  in  chapter  1  (section  2.3),  Province 
and  National  estimates  will  have  smaller  errors, 
well  within  an  acceptable  range,  if  district  esti- 
mates meet  the  specified  standards.  Hence  the 
requirements  at  the  district  level  are  the  deciding 
factor  in  the  sample  design. 

2.3  Total  sample  size  and  workloads 

The  sample  for  a  district  of  average  size  will 
include  about  400  small  holdings  selected  in  about 
20  primary  sampling  units  and  about  50  medium-size 
holdings  selected  with  certainty  from  the  same 
sample  PSU's.   It  will  also  include  all  the  large 
and/or  commercially  important  holdings  from  the 
national  list  of  such  holdings  (about  25  in  the 
average  district).  Thus  the  sample  for  an  average- 
size  district  will  consist  of  about  475  holdings . 
Approximately  72,000  holdings  will  be  in  the  sample 
for  all  districts . 

Estimates  of  the  sample  size  by  Provinces  and 
districts  are  needed  for  purposes  of  establishing 
a  field  organization  and  processing  operations 
far  in  advance  of  the  actual  sample  selection. 
These  estimates  would  be  prepared  by  the  sampling 
technician  as  soon  as  the  sample  design  is  deter- 
mined. Exhibit  17-2-6  gives  the  estimated  work- 
load for  each  of  the  Provinces,  and  exhibit  17-2-7 
gives  the  workload,  by  district,  for  Rajpur  Prov- 
ince. The  term  "workload"  includes  both  the  num- 
ber of  listings  (housing  units)  and  the  number  of 
interviews  for  the  agriculture  questionnaire. 
Note  that  the  workload  varies  considerably  from 
district  to  district  and  Province  to  Province . 

2.4  Sample  units,  sampling  rates,  and 

methods  of  selection 

The  designation  of  the  unit  of  sampling,  the 
sampling  rates,  and  the  method  of  selection  are 
necessarily  closely  related.  Consideration  must 
be  given  to  a  final  plan  which  is  practical  and 
possible  to  carry  out  with  accuracy. 


2.41  National  Certainty  holdings . — Holdings 
on  the  national  list  of  large  holdings  to  be  in- 
cluded in  the  sample  with  certainty,  regardless  of 
where  they  are  located,  are  called  National  Cer- 
tainty holdings.  They  include  about  3,600  known 
estates,  plantations,  and  other  large  holdings  of 
500  hectares  or  more.  They  include  also  some 
small  holdings  known  to  have  specialized  agricul- 
tural operations  of  great  commercial  importance 

to  the  agricultural  production  of  the  country.  All 
holdings  on  the  list  of  National  Certainty  hold- 
ings are  enumerated,  whether  or  not  they  are 
located  in  the  selected  PSU's  described  below. 

2.42  The  first-stage  unit  (PSU).— The  re- 
maining sample  holdings  will  be  selected  in  two 
stages.  For  this  purpose,  primary  sampling  units 
(PSU's)  will  consist  mostly  of  1970  Population 
Census  EA's.  Where  the  census  EA's  are  considered 
to  be  too  small  or  too  large,  they  will  be  com- 
bined or  subdivided  before  selection  to  form  PSU's 
of  more  nearly  equal  size.  The  PSU's  will  be 
stratified  (ordered)  by  whether  they  are  urban 

or  rural,  by  location  within  the  districts,  and 
possibly  by  other  factors  such  as  elevation  and 
type  of  agricultural  activity,  if  this  information 
can  be  obtained  from  local  sources . 

Sample  PSU's  will  then  be  selected  systemati- 
cally at  the  rate  of  1  in  4,  resulting  in  a  total 
of  about  3,000  sample  PSU's  for  all  districts. 
This  will  be  an  average  of  about  20  for  each  dis- 
trict requiring  separate  estimates . 

2.43  Local  Certainty  holdings . — All  holdings 
of  more  than  a  specified  size  which  are  located  in 
the  sample  PSU's  will  be  included  in  the  sample 
with  certainty  and  will  be  referred  to  as  Local 
Certainty  holdings .  The  size  limits  for  these 
holdings  will  vary  by  Province--50  hectares,  100 
hectares,  or  200  hectares.  Approximately  7,000 
Local  Certainty  holdings  are  expected  to  fall 

in  the  national  sample,  but  this  number  might  vary 
by  district  from  zero  to  100  or  more. 

2.44  The  second-stage  unit  (small  holding).— 
A  sample  of  1  in  4  of  all  holdings  between  1/10 
hectare  and  the  local  certainty  size  limit  will  be 
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selected.  This  will  result  in  about  20  such  hold- 
ings per  sample  PSU  and  about  400  for  an  average - 
size  district.  By  definition,  all  holdings  with 
less  than  1/10  hectare  are  excluded  from  the  uni- 
verse, but  the  number  of  these  small  holdings  will 
be  available  from  the  Listing  Book. 

2.5    Weights 

Three  weights  will  be  required  in  tabulating 
the  data.  National  Certainty  holdings  will  re- 
ceive a  weight  of  1,  Local  Certainty  holdings  will 
receive  a  weight  of  4,  and  all  other  sample  hold- 
ings will  receive  a  weight  of  16.  These  weights 
are  the  reciprocals  of  the  probabilities  of  se- 
lection of  the  holdings . 

3.    OUTLINE  OF  THE  ENUMERATION  PLAN 

It  is  planned  that  the  listing  and  enumera- 
tion should  be  completed  in  about  6  to  8  weeks. 
Normally,  one  supervisor  will  be  assigned  to  each 
district,  and  he  will  be  in  charge  of  about  five 
enumerators .  Thus  each  enumerator  will  have  an 
average  of  four  PSU's  in  which  he  will  be  required 
to  list  all  holdings  and  to  interview  the  assigned 
sample  holdings.  Since  the  supervisor  must  select 
the  sample  of  holdings  to  be  interviewed  (and  he 
must  do  this  between  the  listing  and  interviewing 
operations  for  the  PSU) ,  the  enumerator  will  not 
always  be  able  to  complete  both  operations  for  one 
PSU  before  starting  another  PSU. 

3.1  Listing 

During  the  listing  operation,  the  enumerator 
will  visit  each  housing  unit  in  the  entire  PSU 
and  will  make  a  list  of  holdings  operated  by  house- 
hold members .  The  holding  itself  need  not  be  lo- 
cated within  the  sample  PSU.   (See  chapter  4  for 
further  details  on  listing  and  sample  selection). 

3.2  Preparing  the  sampling  frame 

The  supervisor  will  identify  the  holdings 
which  are  National  Certainty  holdings  and  delete 
them  from  the  list  which  is  to  be  used  as  a  frame 
for  sampling  the  small  holdings.  Any  holdings  of 


under  1/10  hectare  which  are  identified  in  the 
listing  will  also  be  excluded  from  the  sampling 
frame  for  small  holdings .  Finally,  the  super- 
visor will  identify  the  holdings  satisfying  the 
criteria  for  Local  Certainty  holdings,  which  are 
similarly  omitted  from  the  sampling  frame,  since 
they  will  be  included  with  certainty. 

3.3  Sample  selection 

After  the  listing  is  completed  and  the  frame 
prepared,  the  supervisor  will  use  the  sampling 
instructions  for  the  PSU  to  select  the  sample  of 
small  holdings.  He  will  also  designate  the  Local 
Certainty  holdings,  and  will  assign  any  of  the 
National  Certainty  holdings  for  which  he  wishes 
the  enumerator  to  obtain  questionnaires . 

3.4  Interviewing 

Enumerators  will  conduct  interviews  at  all 
Local  Certainty  and  sample  small  holdings  in  the 
PSU,  and  they  will  visit  National  Certainty  hold- 
ings as  assigned.  Questionnaires  will  have  been 
mailed,  when  possible,  to  each  of  the  National 
Certainty  holdings .  The  enumerator  will  review 
the  questionnaire  for  completeness  or,  if  neces- 
sary, will  conduct  an  interview. 

3.5  Supervision 

The  supervisor  will  make  several  visits  to 
the  enumerator  to  check  on  the  completeness,  and, 
insofar  as  possible,  on  the  accuracy  of  the  work 
done.  Initially  he  may  assign  one  or  more  PSU's 
to  each  enumerator,  and  he  will  assign  additional 
PSU's  as  these  assignments  are  completed.  He  will 
be  responsible  for  collecting  all  completed  ques- 
tionnaires, Listing  Books,  and  other  materials. 

4.    REASONS  FOR  VARIOUS  FEATURES 
OF  THE  SAMPLE  DESIGN 

The  reasons  for  the  principal  features  of 
the  sample  design  are  nearly  all  economic — either 
the  reduction  of  costs  through  clustering  the 
sample,  or  the  reduction  of  total  required  sample 
size  by  methods  which  increase  the  efficiency  of 
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the  sample .  Many  of  the  latter  methods  depend  on 
the  availability  of  advance  information  about  the 
agricultural  population. 

4.1  Two- stage  sample 

The  sampling  of  the  small  holdings  in  two 
stages  greatly  reduces  travel  costs,  as  compared 
with  a  smaller  and  less  clustered  sample  of  hold- 
ings that  would  have  the  same  reliability.  Another 
important  reason  for  the  use  of  two  stages  is  the 
reduction  in  the  cost  of  preparing  a  frame  for 
the  sampling  of  small  holdings .  For  a  single- 
stage  sample  of  small  holdings,  it  would  be  neces- 
sary to  make  a  list  of  all  of  the  small  holdings 
in  all  PSU's  in  the  universe.  With  a  two-stage 
sample,  it  would  be  necessary  to  prepare  such  a 
list  only  for  a  sample  of  PSU's. 

4.2  PSU'S  of  nearly  equal  size 

With  a  known  population  of  each  EA,  it  was 
possible  to  sort  out  the  very  large  and  very 
small  EA's,  and  to  divide  the  large  ones  into 
more  than  one  piece  and  combine  the  small  ones. 
The  purpose  was  to  form  primary  sampling  units 
(PSU's)  of  more  nearly  equal  size  (number  of 
holdings)  than  the  EA's  in  order  to  reduce  the 
part  of  the  sampling  error  resulting  from  vari- 
ations in  the  size  of  PSU's.  As  described  in 
chapter  3  of  Unit  IV,  the  procedure  resulted  in 
PSU's  having  an  average  of  about  80  small  hold- 
ings, plus  a  few  medium-size  and  large  holdings. 

4.3  Stratification  of  PSU'S 

The  ordering  of  PSU's  by  agricultural  char- 
acteristics is  a  form  of  stratification  which 
makes  it  possible  to  select  a  more  efficient 
systematic  sample  of  PSU's.  For  example,  if  we 
list  consecutively  all  PSU's  in  which  cotton  is 
produced,  and  select  a  l-in-4  systematic  sample, 
we  will  obtain  a  l-in-4  sample  of  PSU's  in  which 
cotton  is  grown.  Without  the  ordering,  the  1-in- 
4  systematic  sample  might  produce  an  accurate 
selection  of  cotton-producing  PSU's,  or  due  to 
sampling  error  it  might  not.  Stratification  by 


a  meaningful  ordering  provides  some  control  on 
sampling  error. 

4A  National  Certainty  holdings 

Since  a  list  of  the  more  important  agri- 
cultural producers  in  Agrostan  can  be  prepared, 
it  is  not  necessary  to  risk  having  these  holdings 
over-  or  under-represented  in  the  sample.  Large 
holdings  and  other  commercially  important  hold- 
ings will  all  be  included  in  the  sample  with 
certainty  regardless  of  whether  they  are  in  the 
selected  PSU's.  For  estimates  of  the  total- 
production  of  crops  grown  exclusively  on  estates 
or  plantations,  such  as  rubber  and  tea,  the  in- 
clusion of  these  holdings  with  certainty  will 
result  in  the  complete  elimination  of  sampling 
error.  For  other  crops  grown  on  these  large 
holdings,  their  inclusion  will  substantially  re- 
duce sampling  error,  as  illustrated  in  section 

5.4  of  this  chapter. 

4.5  Local  Certainty  holdings 

A  large  number  of  holdings  of  intermediate 
size  cannot  be  identified  before  the  listing 
operation.  These  holdings  may  be  so  large  that 
their  combined  agricultural  production  is  an  im- 
portant share  of  the  national  or  district  total. 
Some  of  these  medium-sized  holdings  will  be  in  the 
sample  PSU's,  in  which  case  they  will  be  identified 
during  the  listing  operation  and  will  be  included 
in  the  sample  with  certainty.  A  further  reduction 
in  sampling  error  will  follow. 

4. 6  Stratification  of  holdings  within 

sample  PSU'S 

A  final  control  on  the  sampling  error  is 
provided  by  the  size -of -holding  classification 
which  is  obtained  during  the  listing  operation. 
As  will  be  seen  in  chapter  4,  the  sample  is  se- 
lected separately  from  each  of  four  size  cate- 
gories .  Thus  the  sample  will  include  the  correct 
proportion  of  holdings  from  each  size  group  and 
will  be  more  representative  of  all  the  holdings 
in  the  PSU  than  is  likely  if  the  sample  holdings 
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are  selected  without  stratification. 

5.    FIRST   STEPS   IN   CALCULATING    THE 
NUMBER  OF  SAMPLE  HOLDINGS  NEEDED 

An  indication  of  the  thinking  followed  in 
arriving  at  the  sample  size  for  the  average  dis- 
trict is  given  below.  This  discussion,  as  well 
as  the  remainder  of  chapter  2,  may  be  omitted  by- 
readers  who  are  not  interested  in  the  mathematical 
considerations  of  the  sample  design. 

5.1    Size  of  sample  required  to  estimate  propor- 
tions with  a  simple  random  sample 

Consider  an  attribute  which  is  possessed  by 
about  50%  of  the  holdings  in  any  given  district. 
An  example  might  be  the  attribute  of  having  any 
cattle.  The  size  of  sample  required  to  estimate 
the  percent  of  holdings  having  1  or  more  cattle 
with  a  desired  reliability  from  a  simple  random 
sample  can  be  approximately  determined  from  the 
simplified  formula  for  the  standard  error  of  a 
proportion : 


(2.1) 


p:srs 


=  V! 


PS 


where  P  is  the  proportion  of  all  holdings  having 
the  attribute  (.5  in  this  case),  Q  =  1  -  P,  and  p 
is  the  proportion  estimated  from  the  sample.   If 
we  wish  to  have  95%  confidence  that  the  sample 
will  estimate  P  within  a  10%  error  (that  is,  with- 
in the  range  .45  to  .55),   then  the  error  level 

.05  and  k  is  2  (multiple 

I   confidence) .  Therefore  we 
if 

set 


E  =   kCTp  =  10%  of    .5 
of  o     needed  for 


p:  srs 


.05 


.025 


We  find  that  a  simple  random  sample  of  4-00  cases 

is  needed  to  obtain  this  degree  of  reliability. 

We  have  used  the  simplified  formula  for  a 

up:srs 

instead  of  the  more  exact  one, 

(2.2)         a  ="\/N^n  (PQ  j  . 

P:srs   V  N-l  \n  / 


The  factor  M  ~  -i  has  been  dropped,  since  it  has  a 
value  slightly  less  than  1.  The  average  district 
in  Agrostan  contains  a  total  of  about  6,000  to 
7,000  holdings  (N)  and  the  simplified  formula  gives 
a  slight  overestimate  of  the  size  of  sample  needed 
to  satisfy  the  requirements . 

A  simple  random  sample  of  400  holdings, 
needed  to  estimate  a  P  of  .5  with  specified  accu- 
racy, will  estimate  larger  proportions  more  accu- 
rately and  smaller  ones  less  accurately. 

5.2  Effect  of  clustering  on  sample  size 

We  will  not  use  a  simple  random  sample  in 
the  design  for  Agrostan.  Because  of  cost  and 
travel  considerations,  the  sample  will  consist  of 
groups  of  holdings  which  are  not  so  widely  scat- 
tered as  a  simple  random  sample.  Therefore,  we 
will  need  a  larger  number  of  individual  holdings 
than  would  be  required  for  the  same  accuracy  with 
simple  random  sampling.  The  sample  of  small  hold- 
ings will  be  clustered  in  about  20  PSU's  per 
district  (one -fourth  of  all  the  PSU's  into  which 
the  district  will  be  divided).   On  the  average, 
there  will  also  be  about  20  small  holdings  selec- 
ted for  the  sample  within  each  of  the  selected 
PSU's  (one-fourth  of  all  those  in  the  PSU) .  This 
will  result  in  an  average  of  about  400  small  hold- 
ings per  district. 

The  effect  of  restricting  the  sample  to  se- 
lected PSU's  is  to  increase  the  variance  (squared 
standard  error)  of  an  estimated  proportion  com- 
pared with  that  of  a  simple  random  sample  of  the 
same  size.  The  amount  of  increase  depends  directly 
on  the  degree  of  homogeneity  (similarity)  among 
the  units  in  a  PSU  and  on  the  average  number  of 
holdings  selected  within  PSU's.  The  relation 
between  the  variance  of  a  proportion  estimated 
from  a  cluster  sample  (a  two-stage  sample)  and 
that  of  a  proportion  estimated  from  a  simple 
random  sample  is  given  by 


(2.3) 


p:cl 
'p:cl 


<J2 


p:srs  L 


1  +  6  (n  -  1) 


where 


CTS_n  =  variance  of  p  in  a  cluster  sample 


of  n  holdings 
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a      =  variance  of  p  for  a  simple  random 
sample  of  n  holdings . 


6  (pronounced  delta)  =  intraclass  corre- 
lation coefficient,  which  is  a 
measure  of  homogeneity  of  the 
holdings  within  the  PSU ' s . 

n  =  average  number  of  sample  holdings  per 
selected  PSU.   In  the  Agrostan 
design  this  number  will  be  about 
20. 

A  thorough  discussion  of  6  is  beyond  the  scope  of 

5 

this  unit.  However,  a  number  of  sampling  texts 
provide  good  explanations  of  intraclass  correlation. 
For  the  purpose  of  explaining  the  sample  design  for 
Agrostan,  we  can  say  that  6  is  a  measure  of  simi- 
larity of  the  units  within  the  PSU's.   It  nearly 
always  has  a  value  between  0  to  1,  although  small 
negative  values  are  possible.  The  value  of  6  de- 
pends directly  on  the  total  number  of  small  hold- 
ings per  PSU  (average  of  80  in  Agrostan);  it  is 
not  affected  by  the  value  of  n,  the  average  number 
selected  in  the  sample . 

Generally  6  will  be  large  for  small  PSU's 
where  the  holdings  are  likely  to  be  similar,  and 
it  will  be  smaller  for  larger  PSU's.  There  is  a 
different  6  for  every  characteristic  estimated  by 
the  sample.  Experience  has  shown  that,  for  PSU's 
of  80  holdings,  the  value  of  6  for  many  agricultur- 
al characteristics  can  be  expected  to  be  about  .05. 
This  value  will  be  used  in  the  following  discussion 

Substituting  6  =  .05  and  n  =  20  into  equation 
(2.3)  the  variance  of  p  becomes 


a2   .  =  a2 

p:cl       p:srs 


[l  +  .05(20  -  i)J 


p:srs 

which  is  approximately  2  a2 

1 r  p: srs 

Thus,  if  subsamples  of  20  holdings  are  se- 
lected from  sample  PSU's  having  about  80  holdings, 
the  variance  will  be  nearly  double  that  of  a 
single-stage  simple  random  selection  of  the  same 


3For  example:   Hansen,  Morris  H. ;  Hurwitz,  William  N.;  and 
Madow,  William  G.   Sample  Survey  Methods  and  Theory.  Volume 
I:  Methods  and  Applications.   New  York,  John  Wiley  and  Sons , 
1953.  Refer  to  chapter  6,  section  8  of  this  text. 


total  number  of  holdings .  We  found  above  that  a 
simple  random  sample  of  400  holdings  is  needed  for 
district  estimates  of  desired  accuracy.  There- 
fore, if  a  sample  of  400  holdings  is  clustered  as 
described,  it  would  have  a  variance  twice  as  large 
as  desired.  We  should,  therefore,  need  a  clustered 
sample  of  twice  as  many  holdings,  or  800  in  the 
average-size  district. 

To  summarize  the  effect  of  clustering,  we 
have  found  that  to  obtain  a  specified  standard  of 
reliability  twice  as  many  holdings  are  required 
if  the  sample  is  clustered  as  in  the  Agrostan 
sample  instead  of  being  a  simple  random  selection 
of  holdings .  This  conclusion  assumes  a  particular 
kind  of  clustering  (PSU's  of  N  =  80  and  n  =  20) 
and  a  typical  degree  of  homogeneity  (6  =  .05). 
It  is  not  limited  to  the  case  of  P  =  .5  used  in 
the  examples  although  the  results  400  and  800  are 
for  that  case .  For  characteristics  having  intra- 
class correlations  larger  than  .05,  the  cluster- 
ing will  lead  to  larger  sampling  errors,  and  the 
size  of  sample  needed  for  specified  accuracy  will 
need  to  be  increased. 

The  number  400  is  a  first  approximation  to 
the  size  of  sample  needed  for  specified  accuracy. 
It  is  a  very  rough  approximation,  since  it  does 
not  take  account  of  any  of  the  ways  in  which  the 
Agrostan  sample  departs  from  the  simplest  possible 
sample;  namely,  a  simple  random  sample.  Statis- 
ticians often  begin  to  think  of  the  size  of  sample 
needed  by  first  calculating  what  would  be  needed 
with  a  simple  random  sample .  This  result  is  then 
successively  modified  to  take  account  of  the 
special  features  in  the  actual  design,  some  of 
which  have  the  effect  of  improving  the  accuracy 
as  compared  with  a  simple  random  sample  of  the 
same  size,  while  other  features  (which  are  used 
to  reduce  costs)  have  the  opposite  effect  of  in- 
creasing sampling  errors,  or  requiring  a  larger 
sample  for  specified  accuracy. 

The  number  800  holdings  is  another  approxi- 
mation of  the  sample  size  needed.   It  takes  ac- 
count of  one  special  feature  of  the  design,  which 
can  be  described  by  saying  that  the  sample  of 
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holdings  was  clustered  in  selected  PSU's,  or  by 
saying  that  it  is  a  two-stage  sample  with  PSU's 
as  first-stage  -units  and  individual  holdings  as 
the  second-stage  units . 

The  specified  accuracy  level  used  here  was 
5%   coefficient  of  variation  for  estimates  of  an 
attribute  possessed  by  half  the  sample  holdings. 
This  is  somewhat  similar  to  the  standard  set  in 
section  2.2  as  the  objective  of  the  1971  Agricul- 
ture Census.  That  level  was  a  coefficient  of 
variation  of  5%   for  characteristics  associated 
with  half  of  all  holdings .  Assuming  that  half  of 
all  holdings  have  maize,  an  example  would  be  the 
average  hectares  of  maize.  Such  an  estimate  will 
be  subject  to  a  larger  sampling  error,  and  there- 
fore will  require  a  somewhat  larger  sample  than 
is  needed  to  estimate  the  percent  of  holdings 
having  maize. 

5.3   Size  of  sample  required  to  estimate 
averages  and  aggregates 

To  determine  the  size  of  sample  needed  to 
estimate  an  average  such  as  the  average  hectares 
of  maize  per  holding,  we  can  proceed  by  the  same 
reasoning  as  above,  but  use  the  formulas  appro- 
priate for  this  type  of  item.   Instead  of  formula 

6 

(2.1),  we  can  use  formula: 


(2.4) 

or,  in  relative  form, 


V  =  Vn= 


where 


V 


V  = 


-  relative  standard  error  (or 


coefficient  of  variation, 
the  mean 


of 


=  relative  standard  deviation  of 
the  item  in  the  population. 


This  formula  assumes  that  the  number  of  holdings  (n) 
can  "be  specified  in  advance  and  known  exactly.   In  the 
actual  application  in  Agrostan,  there  is  a  small  sampling 
variability  in  n,  which  arises  partly  from  the  fact  that 
the  PSU's  do  not  have  exactly  equal  numbers  of  holdings, 
and  partly  from  the  fact  that  the  numbers  of  holdings  in 
PSU's  is  usually  not  exactly  divisible  by  4  (the  sampling 
interval).   However,  for  the  present  purpose,  we  can  safely 
ignore  these  variations,  and  treat  x  as  an  average  rather 
than  as  a  ratio.   The  same  is  true  of  the  proportion,  p, 
discussed  in  section  5.1. 


n  -   size  of  sample  needed  to  estimate  X 

with  relative  standard  error  V 

x 

For  the  type  of  items  used  to  determine  the  stand- 
ard of  accuracy  needed  in  the  Agrostan  census 
(namely,  items  occurring  on  half  of  all  holdings), 

V  might  be  as  small  as  1  (100$)  or  as  large  as  4 
(a  standard  deviation  4  times  as  large  as  the 
mean) .   It  could  be  even  larger  for  items  with  a 
highly  skewed  distribution,  where  a  few  holdings 
have  extremely  large  values . 

Using  V_i  =  5%   as  the  desired  relative  stand- 

X 

ard  error,  we  find  that  when  V  =  100$,  this  gives 
n  =  400,  the  same  as  the  size  of  simple  random 
sample  needed  to  estimate  a  proportion  P  =  .5  with 
this  relative  error.  Many  items  included  in  the 
census  and  occurring  on  about  half  or  more  of  the 
holdings,  will  have  much  greater  variability  than 

V  =  100$.  They  would,  therefore,  require  simple 
random  samples  much  larger  than  400  (or  clustered 
samples  larger  than  300)  for  the  specified  accu- 
racy, if  no  modifications  were  made  to  improve  the 
efficiency  of  the  sample . 

Estimates  of  aggregates,  such  as  the  total 
hectares  of  maize  in  the  district,  will  require 
slightly  larger  samples  for  the  same  accuracy  as 
averages.  There  are  two  sources  of  sampling  error 
in  an  aggregate:   (a)  that  resulting  from  error  in 
the  estimated  average,  and  (b)  that  resulting  from 
error  in  the  estimated  total  number  of  holdings . 
With  the  Agrostan  sample  design  the  second  source 
is  almost  entirely  due  to  variability  in  the  num- 
ber of  holdings  between  PSU ' s .  To  the  extent  that 
we  succeed  in  controlling  the  variability  from  this 
source  by  equalizing  the  size  of  PSU's  (see  section 
3  of  chapter  3),  we  can  expect  that  the  number  of 
holdings  needed  to  make  estimates  of  aggregates  at 
a  specified  reliability  will  not  be  greatly  differ- 
ent from  that  needed  to  make  estimates  of  averages . 

We  conclude  that,  while  a  clustered  sample 
of  800  holdings  would  be  needed  to  estimate,  with 
specified  accuracy,  a  proportion  P  =  .5,  much 
larger  samples  would  be  needed  to  estimate  many  of 
the  aggregates  and  averages  for  the  various  crops, 
livestock,  and  other  items  in  the  census .  The  next 
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section  discusses  the  expected  effects  of  the 
special  features  in  the  Agrostan  sample  design 
which  have  the  opposite  effect  from  clustering, 
and  thus  reduce  the  size  of  sample  needed  for 
specified  accuracy. 

5  A  Effect  of  special  features  in  improving  the 
efficiency  of  the  sample 

As  described  in  section  4  above,  a  number  of 
special  features  were  used  in  the  sample  design 

7 

to  improve  its  efficiency  for  a  given  sample  size. 
Probably  the  most  important  of  these  is  the  in- 
clusion of  all  the  extremely  large  holdings  (Na- 
tional Certainty  holdings)  and  the  inclusion  of 
all  the  medium-size  holdings  which  are  found  in 
the  sample  PSU's  (Local  Certainty  holdings) .   In- 
clusion of  these  holdings  in  the  sample  has  a 
very  great  effect  in  reducing  the  errors  in  esti- 
mating averages  and,  consequently,  in  estimating 
aggregates .   In  the  case  of  National  Certainty 
holdings,  their  contribution  to  the  total  produc- 
tion is  measured  exactly,  and  the  variability  among 
the  holdings  remaining  to  be  sampled  (V  in  the 
formula  of  section  5.3) ,   may  be  reduced  to  half 
its  original  value.  To  make  estimates  for  the 
non-certainty  holdings  by  themselves,  therefore, 
the  required  sample  size  is  only  a  fourth  as  large 
as  would  be  needed  to  make  estimates  for  all  hold- 
ings with  a  uniform  sampling  rate  for  all.  To 
make  combined  estimates  for  both  types  of  holdings 
with  specified  relative  error,  an  even  smaller 
sample  size  will  be  adequate. 

To  see  the  possible  effect  of  National  Cer- 
tainty holdings,  consider  an  item  such  as  total 
production  of  a  specified  crop.  Assume  this  total 
for  the  district  is  1,000,000  metric  tons,  and 
that  the  distribution  is  highly  skewed  and  has  a 
coefficient  of  variation  V  =  4-00$.  A  sample  of 
400  holdings  would  estimate  the  mean  X  with  a  co- 


efficient of  variation  of  J±M2.  =   20%.  Assume  that, 

V400 
after  putting  50  very  large  producers  on  the  Na- 


The  use  of  a  two-stage  sample  (clustering)  is  also  for 
the  purpose  of  improving  efficiency,  tut  in  that  case  the 
efficiency  is  a  result  of  the  fact  that  a  larger  sample  can 
be  obtained  for  a  given  cost. 


tional  Certainty  List,  the  remaining  holdings  have 
a  total  of  750,000  metric  tons  and  a  V  of  200$. 
Then  a  sample  of  450,  made  up  of  all  the  National 
Certainty  holdings  and  400  other  holdings,  would 
have  no  error  in  estimating  the  250,000  metric 
tons  associated  with  the  National  Certainty  hold- 
ings .   It  would  have  a  relative  standard  error 

0C\Cff 

of  ,    =  10%   for  estimating  the  mean  for  the 

V400 
smaller  holdings  and  nearly  the  same  relative 

error  for  estimating  their  aggregate.  The  absolute 
error  (75,000  metric  tons)  for  the  non-certainty 
holdings  is  also  the  error  in  the  estimated  total 
for  all  holdings .  As  such,  it  is  a  relative  error 
of   qoq'qqq  =  7-5$.  Thus  the  20%  error  for  a 
sample  of  400  holdings  was  reduced  to  a  1 .%   error 
by  the  designation  of  50  National  Certainty  hold- 
ings .  The  Agrostan  design  is  somewhat  more  com- 
plex, with  Local  and  National  Certainty  holdings . 
The  Local  Certainty  holdings,  being  a  sample  of  a 
fourth  of  all  such  medium-size  holdings,  do  contri- 
bute some  error;  but  the  net  effect  of  both  types 
of  holdings  may  be  about  the  same  as  in  this  ex- 
ample, since  there  are  about  75  of  these  medium 
and  large  holdings  in  the  average  district  sample . 

We  have  already  taken  account  of  another 
feature  of  the  sample  design  which  greatly  reduces 
the  sampling  errors  of  aggregates;  that  is,  the 
approximate  equalization  of  the  size  of  PSU's  (in 
terms  of  number  of  holdings) . 

The  remaining  special  features — mostly  various 
forms  of  stratification — improve  the  estimates  of 
both  aggregates  and  averages  or  proportions . 

When  the  effects  of  all  these  special  features 
are  considered,  it  is  reasonable  to  expect  that 
the  Agrostan  sample  will  satisfy  the  required 
standard  of  accuracy.  That  sample  in  the  average 
district  has  about  475  holdings,  including  400 
small  holdings,  50  Local  Certainty  (medium-size) 
holdings,  and  25  National  Certainty  (very  large) 
holdings . 

5.5   Effect  of  a  constant  overall  sampling  rate 
on  district  sample  size 

To  control  the  number  of  weights   required  to 
tabulate  the  results,   the  sample  of  small  holdings 
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will  be  selected  at  the  same  overall  rate  (l  in  16) 
throughout  the  13  Provinces .  This  will  result  in 
an  expected  sample  as  small  as  150  holdings  in  the 
smaller  districts  (or  combined  districts)  and  as 
large  as  1,000  or  more  holdings  in  the  larger 
districts.  Therefore,  even  the  characteristics 
with  high  S's  or  those  which  can  be  associated 
with  only  a  small  proportion  of  the  holdings  can 
be  estimated  with  the  desired  reliability  in  the 
larger  districts.   In  contrast  to  this,  all  esti- 
mates for  small  districts  (with  150  to  300  sample 
holdings)  will  probably  have  coefficients  of  vari- 
ation larger  than  5fo  and  should  be  used  cautiously. 
A  coefficient  of  variation  greater  than  %   will 
normally  be  acceptable  for  characteristics  asso- 
ciated with  smaller  proportions  of  the  holdings 
as  well  as  for  all  characteristics  in  small  separ- 
ately published  districts,   (it  will  be  recalled 
that  the  smallest  districts  are  not  separately 
published,  but  are  combined  with  other  districts 
for  publication . ) 

5.6  Sample  size  for  cross-classifications 

The  warning  that  a  comparatively  larger  re- 
lative error  must  be  tolerated  for  characteristics 
belonging  to  small  proportions  of  the  population 
is  particularly  applicable  to  cross -classifications, 
Suppose,  for  example,  that  each  of  five  different 
size-of -holding  classes  are  subdivided  into  three 
type -of -holding  categories.  Estimates  of  charac- 
teristics for  these  subgroups  in  an  average  dis- 
trict would  be  based  on  a  sample  of  about  30  hold- 
ings rather  than  475.  Quite  obviously,  the  re- 
lative sampling  error  for  each  of  these  15  esti- 
mates would  be  much  higher  than  for  estimates 
based  on  the  entire  sample . 

In  setting  up  the  tabulation  plans,  there- 
fore, the  expected  number  of  sample  holdings  in 
various  subgroups  must  be  considered.  Generally 
speaking,  the  number  of  cells  in  a  table  should  be 
small  enough  so  that  most  cells  are  represented  by 
at  least  30  to  50  holdings.  The  1961  Census  re- 
sults can  be  used  as  guides  for  establishing  tables 
for  publication  which  meet  this  requirement. 


5.7   Summary 

In  summarizing  the  discussion  on  sample  size, 
it  must  be  emphasized  that  there  is  no  standard 
procedure  for  deciding  sample  size  for  sample 
censuses  where  a  large  number  of  estimates  are 
to  be  made .  Each  characteristic  is  distributed 
among  the  agricultural  holdings  differently.  Even 
if  the  standard  error  for  each  characteristic  were 
known,  it  would  not  be  possible  to  arrive  at  a 
single  sample  size  which  would  provide  all  esti- 
mates at  any  one  specified  level  of  reliability. 
However,  a  sample  of  475  holdings  has  been  deter- 
mined on  the  basis  of  experience  in  agricultural 
surveys  and  supported  by  the  above  discussion,  to 
be  sufficient  to  provide  estimates  of  most  of  the 
more  important  characteristics  such  that  the  co- 
efficient of  variation  will  be  no  greater  than 
about  5%   (10%  error  at  the  95%  confidence  level). 
It  is  assumed  that  a  greater  sampling  error  is 
acceptable  for  less  common  characteristics. 

The  value  assumed  for  the  intraclass  correla- 
tions and  for  the  relative  standard  deviation  in 
the  above  examples  are  believed  to  be  roughly 
accurate  for  many  agricultural  characteristics 
for  clusters  of  80  holdings .  However,  these  are 
only  intelligent  guesses  based  on  experience  in 
a  few  countries .   In  order  that  required  sample 
sizes  can  be  more  accurately  determined  for  the 
next  agriculture  census  and  for  intercensal 
surveys,  variance  calculations  for  a  few  charac- 
teristics should  be  a  part  of  the  processing  plan 
for  the  1971  Agriculture  Census  of  Agrostan. 

6.    DETERMININATION  OF  THE  NUMBER  OF 
FIRST-  AND  SECOND-STAGE  SAMPLE 
UNITS  NEEDED 

The  reader,  at  this  point,  has  probably  ac- 
cepted the  fact  that  a  sample  of  400  small  holdings 
selected  from  20  clusters  and  supplemented  by  the 
certainty  holdings  and  improved  by  the  other 
features  as  described,  will  provide  reliable  dis- 
trict estimates  in  the  sample  census .  He  may  be 
asking  himself,  "Why  not  select  a  smaller  but 
more  widely  distributed  sample  of  holdings?"  For 
example,  a  smaller  total  sample  size  would  be  re- 
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quired  for  the  same  level  of  reliability  if  the 
holdings  were  selected  from  a  larger  number  of 

_        2 

PSU's  giving  a  smaller  n  and  a   x  in  Equation 
(2.3).  The  larger  but  more  clustered  sample  can 
be  justified,  however,  when  costs  are  considered 
together  with  the  reliability  requirements . 

6.1    Types  of  census  costs 

Costs  of  the  census  can  be  broadly  grouped 
into:   (a)  overhead  costs,  (b)  costs  which  vary 
in  proportion  to  the  number  of  sample  PSU's,  and 
(c)  costs  which  vary  in  proportion  to  the  number 
of  sample  holdings . 

6.11  Overhead  costs . — This  category  includes 
costs  of  the  central  administrative  and  technical 
work  on  the  census .  Almost  all  of  the  overhead 
costs  are  present  whether  a  complete  census  or 
sample  is  used.  They  include  the  costs  of  the 
operation  of  the  National  Statistical  Office  and 
the  cost  of  operating  the  regional  agriculture 
census  offices  in  the  field .  For  the  purpose  of 
optimizing  the  sample  design  of  the  agriculture 
census,  the  overhead  costs  will  be  assumed  to  be 
constant  under  alternative  sample  designs . 

6.12  PSU  cost. --The  following  are  examples 
of  items  contributing  to  costs  which  vary  in  pro- 
portion to  the  number  of  PSU's: 

(1)  Preparing  sample  materials  (maps,  etc.)  for 
selected  PSU's. 

(2)  Travel  between  PSU's  and  locating  PSU's. 

(3)  Listing  the  PSU's. 

(A)     Selecting  the  sample  of  holdings. 
(5)  Supervisors'  salaries  (part). 

6.13  Sample  holding  costs . — The  following 
are  examples  of  costs  which  vary  in  proportion  to 
the  number  of  sample  holdings : 

(1)  Interviewing  the  sample  holdings. 

(2)  Printing  questionnaires  and  other  forms. 

(3)  Editing  and  coding. 

(4)  Tabulating. 

(5)  Supervisor's  salary  (part). 

(6)  Travel  within  the  sample  PSU's. 


6.2  Determining  the  cost  function 

In  practice,  it  is  not  an  easy  task  to  divide 
the  total  costs  into  the  three  categories,  but 
usually  the  different  types  of  costs  can  be  rough- 
ly determined  and  their  values  estimated.   If  the 
sample  census  were  a  continuing  survey  in  which 
information  on  costs  could  be  collected,  the  sample 
could  be  changed  to  a  more  nearly  optimum  design 
after  the  first  results  are  available.  For  a  one- 
time operation  such  as  a  census,  care  should  be 
taken  to  make  fairly  accurate  cost  assumptions . 

There  is  very  little  available  data  in  Agro- 
stan  on  which  to  base  assumptions  of  costs.  There- 
fore, intelligent  guesses  must  be  made  of  the 
average  cost  per  PSU  and  the  average  cost  per 
sample  holding  included  in  the  census.  Since  over- 
head costs  are  assumed  to  be  constant  whatever  the 
sample  design,  only  the  PSU  costs  and  the  sample 
holding  cost  (that  is,  the  variable  costs)  will 
affect  the  optimizing  of  the  design.  The  relation- 
ship between  these  costs  is  important. 

It  is  obvious  from  looking  at  the  sources  of 
costs  that  the  unit  cost  related  to  sample  PSU's 
is  much  greater  than  the  unit  cost  related  to 
sample  holdings .  Much  work  is  required  to  prepare 
the  maps  and  other  sampling  materials  for  the 
sample  PSU's.  Motor  vehicles  must  be  used  by 
supervisors  to  travel  between  PSU's  and  each  enu- 
merator will  require  some  means  of  transportation 
primarily  for  travel  between  PSU's.  Travel  costs 
between  PSU's,  including  the  cost  of  vehicle 
maintenance,  are  among  the  largest  costs  of  the 
census.  Also,  the  listing  of  the  sample  PSU's, 
which  requires  a  large  proportion  of  the  enumer- 
ators '  time,  and  the  selection  of  the  sample  hold- 
ings by  the  supervisor,  will  add  much  to  the  ex- 
penses per  sample  PSU.  These  and  almost  any  other 
costs  which  vary  in  proportion  to  the  number  of 
PSU ' s  are  normally  quite  large . 

The  total  cost  associated  with  the  sample 
holdings  has  some  large  components .   Interviewing, 
editing  and  coding,  tabulation,  and  printing  may 
all  be  very  expensive  operations .  When  the  total 
of  these  costs  is  divided  by  the  number  of  sample 
holdings,  the  resulting  unit  cost  will  probably 
be  much  smaller  than  that  for  PSU's. 
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Under  the  conditions  which  prevail  in  Agro- 
stan, the  ratio  of  the  unit  cost  for  PSU's  to  the 
■unit  cost  for  sample  holdings  can  be  expected  to 
be  roughly  about  20  to  1.  Assume  that  the  cost 


(C  )  of  including  a  PSU  is 


and  the  cost  (C  ) 
s 


of  including  one  sample  holding  in  the  census  is 
$2.  Then  the  optimum  design  contains  the  number 
of  sample  PSU's  (m)  and  sample  holdings  within 
PSU's  (n)  which  minimizes  the  total  variable  cost 
for  the  desired  level  of  reliability.  The  formula 
for  the  cost  function  is  given  below: 


(2.5) 


Cm+Cmn=C-C 

lb  o 


where:  C  =  unit  cost  per  PSU,  based  on  the  costs 
related  to  the  number  of  sample  PSU's 

C  =  unit  cost  per  holding,  based  on  the 
costs  related  to  the  number  of  sample 
holdings 

C  -  C  =  total  cost  minus  overhead  cost 
(total  variable  cost) 

m  =  number  of  PSU's  in  the  sample 

5  =  average  number  of  holdings  per  sample 
PSU. 


6.3  Determining  the  number  of  PSU'S  and  the 
average  size  of  subsample 

The  value  of  20  for  n,  the  average  size  of 
subsample  to  be  selected  within  a  sample  PSU,  was 
obtained  by  a  rough  method  of  optimizing  described 
in  exhibit  IV-2-8.   It  uses  trial  values  of  n, 
various  assumptions  consistent  with  those  devel- 
oped in  section  5  above,  and  unit  costs  as  in 
section  6.2.  Resulting  values  of  m  (the  number 
of  PSU's  needed)  and  of  C  -  C  ,  the  total  variable 
cost,  are  then  found.  We  then  choose  a  value  of 
n  which  is  close  to  the  optimum  and  is  a  convenient 
fraction  of  N  (the  total  number  of  holdings  in  the 
average  district) .  In  this  case  the  fraction  -£■, 
giving  n  =  20,  was  chosen.  The  value  of  m  was  al- 
so obtained  by  choosing  a  value  of  m  close  to  the 
optimum,  which  is  a  simple  fraction  of  the  number 
of  PSU's  in  the  average  district;  m  also  was  \  in 
this  case.  Thus  the  sample  in  the  average  district 
consists  of  about  20  PSU's  each  with  about  20  small 


holdings,  but  it  is  the  fraction  {\)   which  is  held 
constant — not  the  number  20.  This  assures  us  of 
having  a  probability  sample  of  all  holdings,  with 
the  following  probabilities  of  selection  and 
weights  for  the  three  categories  of  holdings : 


Probability 
of  selection   Weight 

i  v  1        1 

4  X  4  "  16     16 

i 

National  Certainty  holdings       1         1 


Category 

Small  holdings 

Local  Certainty  holdings 


7.    REMARKS  ON  SAMPLE  DESIGN 

The  planners  for  the  Agrostan  Census  of  Agri- 
culture are  fortunate  in  being  able  to  use  the  1970 
Population  Census  EA's  to  form  PSU's.  Otherwise 
the  cost  of  forming  clusters  would  have  been 
greater  and  a  different  PSU  size  (greater  or  small- 
er than  the  EA)  might  have  been  chosen.  With  a 
smaller  or  larger  PSU,  the  value  of  6  for  each 
characteristic  would  be  different.  For  example, 
if  smaller  clusters  than  80  holdings  were  used, 
the  values  of  6  would  generally  be  higher.  Larger 
clusters  usually  have  many  characteristics  with 
low  values  of  6.  This  is  true,  primarily,  because 
small  clusters  contain  holdings  which  tend  to  be 
similar.  Large  clusters  are  more  variable,  and 
are  usually  more  efficient  for  countries  like 
Agrostan  where  the  cost  of  including  a  PSU  in  the 
sample  is  very  high. 

With  some  general  knowledge  of  the  measures 
of  homogeneity,  it  is  possible  to  assume  a  typical 
value  for  6  which  is  reasonably  accurate  for  most 
of  the  important  characteristics  in  the  census . 
Assumptions  of  costs  can  be  based  on  experience  in 
agriculture  surveys,  and  an  approximately  opti- 
mum design  for  making  estimates  of  many  different 
types  of  characteristics  can  be  determined. 

If  work  on  the  sample  design  can  be  started 
early  enough,  it  would  be  highly  desirable  to 
base  cost  estimates  and  other  assumptions  on  re- 
sults of  a  pretest.  This  might  be  made  in  one 
district  selected  by  judgment  to  best  serve  the 
purpose . 


Chapter  IV  -  3.    SELECTING  THE  SAMPLE  OF  FIRST-STAGE  UNITS 


1.    INTRODUCTION 

With  the  sampling  plan  for  the  Provinces 
established,  work  can  begin  on  the  selection  of 
the  sample  units.  This  work  can  be  divided  into 
four  major  operations,  each  of  which  is  made  up 
of  smaller  operations.  The  four  major  operations 
are:   (a)  identifying  the  National  Certainty  (NC) 
holdings,  (b)  forming  a  frame  of  PSU's,  (c)  selec- 
ting the  sample  PSU's,  and  (d)  listing  all  hold- 
ings and  selecting  sample  holdings  within  the  sam- 
ple PSU's.  The  first  operation  is  partly  an  office 
and  partly  a  field  operation;  the  second  and  third 
operations  are  primarily  office  functions ;  these 
three  operations  will  be  described  in  this  chapter. 
The  last  operation  is  done  in  the  field  and  will 
be  described  in  the  next  chapter. 

2.    NA  TIONAL  CER  TAINT  Y  HOLDINGS 

An  example  of  the  importance  of  the  National 
Certainty  holdings  in  making  census  estimates  has 
been  given  in  section  5.4-  of  chapter  2.  Because  of 
their  importance,  the  identification  of  the  large 
agricultural  producers  for  the  National  Certainty 
list  will  be  given  high  priority  in  the  early  stages 
of  census  work.  From  a  sampling  point  of  view,  the 
National  Certainty  holdings  can  be  considered  as  a 
separate  universe  or  population  which  is  completely 
enumerated  in  the  census.  The  problems  are  (a)  to 
define  this  population  clearly,  (b)  to  construct  an 
accurate  list  of  the  units  it  contains,  and  (c)  to 
insure  that  no  holding  will  have  a  chance  of  being 
in  both  the  National  Certainty  list  and  the  list 
from  which  the  sample  holdings  are  selected.  Exhi- 
bits IV -3-1  and  IV-3-2  list  the  large  holdings  in 
Ranchi  District  in  Ra jpur  Province . 

2.1    Definition 

National  Certainty  holdings  will  be  defined 
as  (a)  all  known  agricultural  holdings  with  more 


than  500  hectares  of  land,  and  (b)  all  known  ag- 
ricultural holdings  of  any  size  on  which  important 
special  crops  are  produced  or  which  are  important 
in  the  livestock  or  poultry  production  of  the 
country.  Holdings  of  less  than  500  hectares  which 
would  qualify  as  National  Certainty  holdings  in- 
clude nurseries  and  greenhouses  producing  stock 
for  sale,  holdings  producing  silk  fiber,  holdings 
with  5,000  or  more  poultry,  dairy  holdings  with 
100  or  more  milk  animals,  feedlots  with  500  or 
more  animals,  and  all  government-owned  experimen- 
tal holdings.  Based  upon  the  1961  Census  results, 
about  3,600  holdings  that  are  large  or  have  spe- 
cialized agricultural  operations  will  be  listed. 
After  the  list  of  National  Certainty  holdings  is 
prepared,  deletions  and  additions  generally 
would  not  be  permitted.   (See  section  2.4  below 
for  exceptions . ) 

Some  districts  will  have  many  large  holdings, 
of  which  only  the  very  large  holdings  would  be 
included  as  National  Certainty  holdings.  Other 
districts  will  have  no  very  large  holdings  but 
smaller  holdings  which  will  be  included  because  of 
their  specialized  agricultural  activities.  Esti- 
mates made  for  the  nation  as  a  whole  would  be 
quite  reliable  with  a  much  shorter  list  of  large 
holdings.  Reliability  of  the  estimates  for  a 
district  will,  however,  depend  greatly  on  the 
National  Certainty  holdings  in  the  district. 

2.2  Sources  of  data 

Most  of  the  major  contributors  to  the  agri- 
cultural activities  of  a  district  will  be  obvious 
to  the  person  preparing  the  list  of  National  Cer- 
tainty holdings.  The  large  estates  and  plantations 
are  usually  well  known  throughout  the  district  in 
which  they  are  located.  In  addition,  a  systematic 
search  should  be  made  of  as  many  sources  as  possi- 
ble in  order  to  find  the  other  large,  but  less 
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well-known,  agricultural  producers.  Some  of  the 
major  sources  of  information  are  given  below. 

2 . 21  List  of  1961  large  holdings. — Holdings 
of  500   hectares  or  more  enumerated  in  the  1961 
Sample  Census  of  Agriculture  will  "be  listed  by 
the  NSO  staff  and  provided  to  the  District  Agri- 
cultural Agents  \Nfho  will  compile  the  1971  lists. 
The  1961  list  will  provide  the  starting  point, 
(see  exhibit  IV-3-1)  and  the  Agricultural  Agents 
are  required  to  account  for  each  1961  holding  and 
add  all  other  holdings  which  meet  the  requirements 
for  the  1971  Census.  The  1971  lists  are  compiled 
several  months  before  the  start  of  the  field  enu- 
meration. 

2.22  The  Ministry  of  Agriculture. — Officials 
of  the  Ministry  of  Agriculture  have  frequent  con- 
tacts with  operators  of  large  holdings.  A  few 
Government  officials  specializing  in  the  main 
commercial  crops  are,  therefore,  likely  to  have 
lists  of  major  producers  and  their  locations . 

2.23  Local  off icials .— At  the  district  level, 
the  operators  of  large  holdings  work  closely  with 
local  officials  in  the  production  and  marketing 

of  their  crops.  Agricultural  Agents  and  other 
district  officials  will  know  most  of  the  important 
holdings  in  their  areas. 

2 . 24  Associations  and  cooperatives. — The 
growers  of  specialized  crops  such  as  coffee  and 
tea  will  probably  be  members  of  associations  and 
cooperatives.  Many  of  these  growers  will  already 
be  on  the  list  of  National  Certainty  (NC)  holdings 
which  was  obtained  from  other  sources;  however, 

by  comparing  the  association  and  cooperative  mem- 
berships with  these  other  sources,  a  few  addition- 
al large  holdings  may  be  added  to  the  list. 

2.3   Small  holdings  erroneously  included 

Because  of  inaccurate  information,  some  small 
holdings  which  are  not  very  important  to  the  cen- 
sus may  be  included  in  the  list  of  NC  holdings. 
The  cost  of  locating  and  conducting  interviews  at 
these  holdings  may  be  very  high  in  proportion  to 
their  importance,  but  there  will  be  no  bias  caused 
by  including  them.   Once  a  holding  is  placed  on 
the  list  of  NC  holdings  and  excluded  from  those 


listed  in  the  sample  PSU's  (in  the  Listing  Book), 
it  should  be  retained  and  treated  the  same  as  other 
NC  holdings.   If  such  holdings  are  reclassified 
after  selecting  the  sample  of  small  holdings  it 
is  very  difficult  to  avoid  a  bias  in  the  estimates 
from  the  sample . 

2.4   Large  holdings  found  after  1971  lists 
are  compiled 

After  the  District  Agricultural  Agents  com- 
pile the  1971  lists  of  NC  holdings  (see  exhibit 
17-3-2),  the  lists  are  sent  to  the  NSO  for  dis- 
tribution to  the  field  supervisors  for  the  census . 
As  one  of  the  preliminary  field  operations,  the 
census  supervisor  is  required  to  spot  the  NC 
holdings  on  his  map  of  the  district  so  that  he 
can  appropriately  assign  them  to  the  enumerators 
for  completion  of  the  Agriculture  Questionnaires . 
At  the  same  time,  the  supervisor  is  to  search  for 
very  large  holdings  of  1,000  hectares  or  more 
which  are  missing  from  the  district  list.  He 
makes  this  search  in  all  PSU's  in  the  district 
(not  only  in  sample  PSU's),  adds  such  holdings  to 
the  NC  list,  and  assigns  them  for  enumeration. 
They  will  thus  carry  a  weight  of  1  in  the  analysis 
of  census  data. 

If  an  enumerator  finds  a  large  holding  of 
500  hectares  or  more  in  a  sample  PSU  (for  example, 
a  holding  of  600  hectares  or  one  of  over  1,000 
hectares  which  was  missed  in  spite  of  a  search  by 
supervisors),  he  should  list  it  in  the  usual  way 
and  it  will  be  treated  as  a  Local  Certainty  hold- 
ing.  It  will  then  carry  a  weight  of  4,  since  we 
must  assume  that  only  a  fourth  of  all  such  hold- 
ings would  be  discovered  within  the  sample  PSU's. 
If  local  information  makes  it  seem  almost  impossi- 
ble that  there  are  other  such  holdings  in  non- 
s ample  PSU's  comparable  to  one  found  in  a  sample 
PSU,  it  might  in  exceptional  cases  be  safer  for 
the  supervisor  to  add  such  a  holding  to  the  National 
Certainty  list,  thus  giving  it  a  weight  of  1. 

The  procedure  described  above  was  adopted  for 
Agrostan  in  order  to  keep  the  sample  as  unbiased 
as  possible.   Other  criteria  could  be  used,  such 
as  a  cut-off  point  higher  or  lower  than  1,000  hect- 
ares for  the  supervisor's  search. 
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3.    SAMrLE  OF  PSU'S 

The  holdings  which  appear  on  the  list  of  the 
NC  holdings  will  be  included  in  the  sample  with 
certainty.  A  sample  will  be  selected  from  all 
other  holdings  in  the  country,  and  interviews  will 
be  conducted  only  at  these  sample  holdings  and  the 
NC  holdings.  To  control  the  amount  of  travel  be- 
tween interviews,  the  sample  holdings  will  be 
clustered  in  selected  primary  sampling  units 
(PSU's).  Thus,  the  other-than-NC  holdings  will 
be  selected  in  two  stages:  first,  a  sample  of 
PSU's,  then  a  subsample  of  holdings  within  the 
selected  PSU's. 

A  considerable  amount  of  preparatory  work 
must  precede  the  actual  selection  of  PSU's.  This 
will  require  the  use  of  any  available  information 
about  the  PSU's  and  of  materials  from  the  1970 
Population  Census .  The  preparatory  work  will  be 
done  in  the  following  steps : 

(1)  A  sampling  frame  is  prepared  by  forming 
PSU's  from  the  1970  Population  Census  Enu- 
meration Areas  (EA's). 

(2)  The  PSU's  are  arranged  in  order  (stratified) 
for  selection  of  the  sample . 

These  steps  must  be  performed  for  each  district 
in  the  13  Provinces . 

3.1    Forming  the  PSU'S 

PSU's  will  consist  of  1970  Population  Census 
EA's  or  of  units  formed  from  the  Population  Census 
EA's  using  the  EA  maps  and  preliminary  1970  popu- 
lation counts    (see  exhibits   IV-2-1  and  TV-3-6  for 
Ranchi  District  in  Rajpur  Province).     As  they  are 
formed,    PSU's  are  listed  on  the  PSU  Sample  Selec- 
tion Sheet   (exhibit  IV-3-8),    or  they  are  recorded 
on  cards    (see  section  3.14-  below).     The  limits  of 
the  1970  EA's  will  be  used  as  the  boundaries   of 
PSU's;  where  EA's  are  subdivided,    physical  bound- 
aries such  as  roads,   rivers,    and  other  natural 
boundaries  which  can  be  clearly  seen  and  defi- 
nitely identified  by  the  enumerator  should  be 
f ollowed .     Because  of  special  problems  which  ex- 
ist in  the  urban  areas  and  in  very  remote  rural 
areas,    definitions  of  PSU's  will  be  slightly  dif- 
ferent from  those  used  in  the  ordinary  rural  areas . 


Ultimately,    a  sampling  frame  of  PSU's  covering  the 
entire  universe  will  be  available . 

3.11     Rural  PSU's. --Most  of  the  1970  Popula- 
tion Census  EA's  which  are  located  in  the  rural 
areas  will  be  acceptable  as  PSU's  without  modifi- 
cation.    As  shown  in  section  1.4  of  chapter  2, 
these  EA's  on  the   average  contain  515  persons  and 
85  holdings     of  l/lO  hectare  or  more .     However, 
there  will  be  a  few  EA's  which  are  much  smaller  or 
much  larger  than  the  average  EA.     Modifications   of 
these  EA's  will  be  required,   therefore,   to  form 
PSU's  closer  to  the  average  size.     For  this  pur- 
pose,  we  use  population  data  and  assume  that  by 
approximately  equalizing  population  we  will 
approximately  equalize  the  number  of  holdings . 

For  equalizing  PSU's,    the  convenient  number 
of  500  persons    (rather  than  515)  was   chosen  as 
the  desired  average  size  of  PSU.     This  will  corre- 
spond to  about  83  holdings .      To  prevent  excessive 
variation  from  the  average  size  PSU,    a  minimum 
PSU  size  of  about  300  persons  and  a  maximum  PSU 
size  of  about  700  persons  will  be  established. 
Almost  all  of  the  modifications   of  the  EA's  will 
result  from  this  size  limitation.      Of  the  average 
of  83  holdings,   about  3  will  be  Local  or  National 
Certainty  holdings,    leaving  about  80  small  hold- 
ings  in  the  average  PSU. 

Exhibit  IV-2-2  shows  the  distribution  of 
rural  EA's  by  population  in  Rajpur  Province,   which 
is  a  typical  Province.     Most  of  the  EA's  in  this 
Province  already  fall  within  the  limits  required 
for  rural  PSU's.     We  will  need  to  combine  some  of 
those  which  are  very  small  and  divide  into  more 
than  one  piece  some  of  those  which  are  very  large . 
The  combinations  will  always  be  made  with  EA's 
sharing  a  common  boundary.     The  divisions  will  in 
some  cases  form  two  or  more  separate  PSU's.      In 


The  figure  85  is  a  rough  estimate  based  on  data  and 
guesses  from  several   sources.      The   1970  Population  Census 
did  not  obtain  information  on  the  number  of  holdings; 
therefore,    data  from  the   1960  Population  Census  and  the 
1961  Sample  Census  of  Agriculture  were  used.      Exhibits 
1-3-34  and  I -2-35  indicate  an  average  of  5.5  persons  per 
rural  housing  unit.      Unpublished  data  from  the   1961  Census 
indicate  that  the  ratio  of  housing  units  to  holdings  of 
1/10  hectare  or  more  was  about    .91   in  the  EA.' s  defined  as 
rural.      Thus,    (515  X   .9l)  t  5.5  =  85.      Exhibits   IV-2-6  and 
IV-2-7   provide  more   detailed   estimates   by  Provinces  and 
districts,    combining  the  rural  and  urban  portions   that 
would  be  in  the   sample. 
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other  cases,  two  PSU's  in  combination  will  be  di- 
vided into  a  total  of  three  parts.  The  prelimi- 
nary population  counts  for  EA's  in  Ranchi  district 
(in  Rajpur  Province)  are  shown  in  exhibit  IV-2-1. 

There  is  a  special  set  of  rules  for  forming 
PSU's  from  rural  EA's  where  travel  will  be  very 
difficult  and  the  cost  of  including  them  in  the 
sample  census  will  be  very  high.  The  objective 
here  is  to  make  PSU's  which  are  somewhat  larger 
in  terms  of  number  of  holdings,  and  thus  to  have 
fewer  of  this  type  of  PSU  in  the  universe  and  in 
the  sample.  The  average  size  of  these  less  acces- 
sible rural  PSU's  will  be  approximately  700  per- 
sons with  a  lower  limit  of  about  500  and  an  upper 
limit  of  about  900.  They  are  referred  to  as  "re- 
mote" in  the  urban/rural  classification  of  PSU's. 
Sometimes  the  EA's  can  be  divided  into  more  than 
one  piece  with  the  use  of  the  EA  map  (see  exhibit 
II-3-4-).   In  most  cases,  however,  some  field  work 
is  required  to  divide  the  EA.  This  field  work 
would  be  quite  costly  if  every  such  EA  map  were 
taken  to  the  field  for  division.  Therefore,  a 
method  has  been  devised  which  requires  a  large  EA 
to  be  divided  only  if  it  falls  into  the  sample . 

3.12  Measures  of  size. — Each  single  PSU 
will  be  assigned  a  measure  of  size  of  one.  Where 
an  EA  needs  to  be  divided  into  several  separate 
PSU's,  two  or  more  measures  of  size  will  be  as- 
signed.  If  an  EA  is  not  of  an  appropriate  size 
for  simple  division  or  combination,  or  if  the  map 
boundaries  do  not  permit  it,  variations  of  this 
procedure  will  be  used.  As  stated  above,  it  may 
be  necessary  to  put  two  EA's  together  and  assign 
three  measures  of  size  to  the  combined  PSU.  Such 
combined  PSU's  will  be  divided  only  if  selected 
in  the  sample.  The  measures  of  size  will  be 
cumulated  and  sampled  systematically  as  described 
in  section  3.3   below.  By  doing  the  additional 
field  work  of  dividing  the  PSU's  only  if  one  of 
these  combined  PSU's  falls  into  the  sample,  a  con- 
siderable savings  in  field  work  will  result. 

3.13  Urban  PSU's.— The  1970  Population  Census 
EA's  were  larger  in  the  urban  areas  than  in  rural 
areas  (see  exhibit  IV-2-3);  however,  the  percent- 
age of  households  associated  with  holdings  is  much 


smaller  in  urban  areas.  This  percentage  may  vary 
according  to  the  size  of  the  urban  place  and  the 
degree  of  urbanization  in  the  area  surrounding  the 
city  or  hamlet. 

In  examining  the  urban  places,  the  NS0  staff 
felt  that  a  high  proportion  of  households  in  the 
smaller  urban  places  would  have  some  agricultural 
operations  and  thus  qualify  as  holdings.  There- 
fore, they  decided  that  places  under  5,000  popula- 
tion (according  to  the  1970  Census  of  Population) 
would  be  treated  the  same  as  rural  EA's. 

Urban  places  with  a  population  of  5,000  or 
over  will  be  treated  differently.  The  built-up 
sections  of  such  cities  and  hamlets  will  be  ex- 
cluded from  the  universe  (as  explained  in  chapter 
1  of  Unit  II).   The  assumption  here  is  that  the 
cost  of  listing  and  enumerating  holdings  in  the 
city  will  be  much  too  great  when  compared  with  the 
contribution  of  these  central  urban  areas  to  the 
agricultural  statistics  of  the  country. 

In  order  to  exclude  the  built-up  sections  of 
large  urban  areas  and  to  form  PSU's  of  the  re- 
mainder of  urban  areas,  maps  of  the  cities  showing 
the  1970  Population  Census  EA's  will  be  required 
(see  exhibit  IV-3-5).  The  PSU's  can  be  formed  on 
the  map  by  combining  EA's,  or  parts  of  EA's,  to 
obtain  PSU's  of  about  80  holdings. 

EA's  in  the  larger  urban  places  contain  on 
the  average  about  1,000  persons  or  approximately 
200  households.   The  person  preparing  the  PSU's 
from  these  EA's  will  accept  the  entire  EA  as  a 
PSU  without  change  if  in  his  judgment  it  contains 
approximately  80  holdings .  As  in  the  case  of  the 
rural  and  small  urban  areas,  the-  1970  Population 
Census  EA's  should  in  most  cases  make  satisfactory 
PSU's;  but  when  supplemental  information  indicates 
that  an  urban  EA  has  more  or  less  agricultural 
activity  than  usual,  modifications  of  the  EA's  will 
be  necessary.  The  modification  should  then  be 
carried  out  in  the  same  manner  as  in  forming  PSU1s 
in  the  smaller  urban  places,  except  that  a  judgment 


2National  Certainty  holdings  which  are  located  in  built- 
up  areas  would,  however,  be  included  in  the  census. 
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on  number  of  holdings  will  be  used  instead  of  the 
minimum  and  maximum  population  limitation. 

3.14  PSU  card. --After  the  PSU's  are  formed 
and  listed,  the  sample  PSU's  can  be  selected.   In 
some  districts,  the  selection  can  be  made  directly 
from  a  listing  of  PSU's  entered  in  numerical  order 
on  the  PSU  Selection  Sheet,  Form  A-261  (see  ex- 
hibit IV-3-8) .   In  other  districts,  however, 
where  there  is  a  great  variety  of  agricultural 
activities,  important  gains  in  reliability  can 
be  obtained  by  further  stratifying  (ordering)  the 
PSU's  by  characteristics  related  to  agriculture, 
before  selection.   In  these  districts,  the  pre- 
paration of  PSU  cards  will  facilitate  the  strati- 
fication. A  PSU  card  will  be  required  for  each 
single  PSU  and  one  for  each  pair  or  combination 
of  PSU's,  showing  the  measure  of  size. 

Not  much  data  will  be  available  from  the  1970 
Population  Census  which  can  be  transcribed  to  the 
PSU  card;  however,  through  the  use  of  maps  and 
general  knowledge  of  the  district,  some  informa- 
tion about  each  PSU  in  the  district  may  be  avail- 
able .  Ihe  items  of  information  which  can  probably 
be  obtained  for  each  PSU  appear  on  the  PSU  card 
shown  as  exhibit  TV-3-4.  They  include  the  fol- 
lowing: 

Items  (a)  through  (g). --These  items  identify  the 
PSU.  The  Sample  PSU  Numbers  are  a  set  of 
consecutive  numbers  for  the  selected  PSU's, 
numbered  in  the  order  drawn.  The  Card  Num- 
bers are  simply  a  set  of  consecutive  numbers 
assigned  to  the  PSU's  after  they  have  been 
stratified  (ordered) . 

Item  1,  Population. — Enter  the  preliminary  1970 
Population  Census  count. 

Item  2,  Measure  of  size. — Enter  the  measure  of 
size  for  the  PSU. 

Item  3,  Urban/rural. — Check  urban  or  rural  as  de- 
fined for  the  1970  Population  Census,  except 
to  designate  certain  rural  PSU's  as  "remote." 
Rural  PSU's  which  appear  on  a  topographical 
map  to  be  in  mountainous  areas  or  in  iso- 
lated, undeveloped  areas  will  be  classed  as 
"Remote . " 

Item  4,  Elevation. --The  average  elevation  in  broad 
class  intervals  can  also  be  taken  from  a 
topographical  map  of  the  district. 


Item  5,  Type  of  agriculture. --The  person  preparing 
the  PSU  cards  must  exercise  some  judgment 
based  on  a  knowledge  of  the  district  to  de- 
scribe type  of  agriculture .   If  the  PSU  is 
located  in  an  area  containing  crops  which  are 
different  from  those  grown  in  the  rest  of  the 
district,  he  will  enter  the  type  of  crops 
grown  in  the  PSU.  For  example,  cotton  is 
often  grown  near  rivers,  in  low-lying  areas; 
if  it  is  known  that  cotton  is  grown  in  the 
PSU's  located  along  the  river  in  a  particular 
district,  the  word  "cotton"  will  be  entered 
in  this  item  for  the  appropriate  PSU ' s . 

Item  6,  Remarks. — This  item  may  be  used  to  record 
any  unusual  or  unique  features  of  the  PSU. 


3.2  Stratification  and  listing  of  the  PSU's 

The  1970  Population  Census  EA's  have  been 
numbered  so  that  those  which  are  near  each  other 
are  consecutively  listed.  A  listing  of  resulting 
PSU's  on  Form  A-261  following  this  order  will 
provide  a  geographic  stratification,  which  will  be 
the  only  stratification  needed  for  most  of  the 
districts.  However,  as  mentioned  in  section  3.1 
above,  there  will  be  some  districts  in  which 
PSU's  are  not  similar  to  their  neighboring  PSU's, 
but  are  more  nearly  like  other  PSU's  in  the  dis- 
trict; these  are  the  districts  for  which  PSU  cards 
were  prepared.  The  final  stratification  in  these 
districts  will  result  in  reordering  the  PSU's  so 
that  they  can  be  listed  on  Form  A-261  with  similar 
PSU's  appearing  consecutively.  The  PSU  cards  will 
be  sorted  using  items  3,  4,  and  5  and  the  remarks 
item.  Item  1  (Total  population)  should  have  been 
made  approximately  equal  for  all  PSU's  when  the 
1970  Population  Census  EA's  were  modified  to  form 
PSU's.  Therefore,  the  first  step  in  the  stratifi- 
cation of  each  district  is  to  inspect  item  3  (the 
Urban/rural  code)  for  all  cards  in  a  district,  and 
to  group  together  cards  for  urban  PSU's,  followed 
by  rural  PSU's,  then  remote  PSU's.  Then  the  cards 
are  sorted  by  item  4  (Elevation)  within  each  urban/ 
rural  group.  After  this  is  completed,  the  strati- 
fication might  require  further  refinement  using 
item  5  (Type  of  agriculture),  and  item  6  (Remarks;. 

The  information  in  item  5  will  be  incomplete 
since  there  will  be  no  entry  for  most  PSU's.  How- 
ever, items  5   and  6  may  show  that  there  are  some 
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specialized  crops  grown  in  various  parts  of  the 
district.  If  so,  the  cards  should  be  rearranged, 
placing  together  those  PSU's  in  which  specialized 
crops  are  grown  (without  regard  to  urbanization 
and  elevation) .  The  following  illustrates  strati- 
fication for  a  district  which  has  several  differ- 
ent types  of  agricultural  activities. 


Sub- 

EA  No. 

Urban/ 

Elevation 

Type  of 
agri- 

district 

rural 

culture 

01 

001 
002,003 

004 

005 
006,007 

008 

Urban 
Urban 
Urban 
Urban 
Urban 
Urban 

02 

001 

Rural 

Lowlands 

RLce 

002 

Rural 

Lowlands 

Rice 

003,007 

Rural 

Lowlands 

Rice 

006 

Rural 

Lowlands 

RLce 

03 

001 

Rural 

Lowlands 

Rice 

006 

Rural 

Lowlands 

Rice 

02 

004 

Rural 

Lowlands 

Rubber 

005,008 

Rural 

Lowlands 

Rubber 

03 

005 

Rural 

Lowlands 

Rubber 

007 

Rural 

Lowlands 

Rubber 

002 

Rural 

Lowlands 

Sugarcane 

003,004 

Rural 

Lowlands 

Sugarcane 

009 

Rural 

Lowlands 

Sugarcane 

010,011 

Rural 

Plateau 

Cattle 

04 

003 

Rural 

Plateau 

Cattle 

004 

Rural 

Plateau 

Cattle 

005 

Rural 

Plateau 

Cattle 

008,009 

Remote 

Plateau 

Sheep 

010 

Remote 

Plateau 

Sheep 

03 

008 

Remote 

Plateau 

Sheep 

04 

001 

Remote 

Highlands 

Sheep 

002 

Remote 

Highlands 

Goats 

006,007 

Remote 

Highlands 

Goats 

011 

Remote 

Highlands 

Goats 

012 

Remote 

Highlands 

Goats 

The  greatest  gain  in  the  stratification  pro- 
cess will  result  from  the  geographical  ordering 
of  PSU's  produced  by  the  numbering  system.  For 
this  reason  the  further  stratification  of  the 
PSU's  (and  the  preparation  of  PSU  cards)  should 
not  be  allowed  to  consume  a  large  proportion  of 
the  total  census  resources  or  of  the  professional 
time  of  NS0  statisticians.   It  should  be  noted 
that  the  rearrangement  of  the  PSU's  by  the  first 
one  or  two  items  will  produce  most  of  the  addi- 
tional gains  from  stratification.  If  accurate 


information  is  available  on  all  four  items,  addi- 
tional gains  will  result  from  refining  the  strat- 
ification by  these  items.  The  information  required 
for  this  deeper  stratification  will  be  available 
for  only  a  few  districts  in  Agrostan. 

3.3   Selecting  sample  PSU's 

After  the  PSU's  are  formed  and  ordered  into 
homogeneous  groups,  they  are  listed  along  with 
their  measures  of  size  on  Form  A-261;  then  the 
sample  PSU's  can  be  selected.  Several  different 
methods  could  be  employed  in  selecting  a  sample 
of  PSU's.   In  choosing  the  method  of  selection 
for  Agrostan,  reliability  and  convenience  were 
important  considerations .  The  adopted  method  of 
selection  produces  an  efficient  sample  for  dis- 
trict estimates;  also  the  selection  procedure  and 
estimating  methods  are  easy  to  apply. 

For  the  Agrostan  sample,  the  sampling  inter- 
val was  determined  by  the  reliability  requirements 
for  the  average  district.  In  section  6  of  chapter 
2,  it  was  explained  that  an  average  of  20  sample 
PSU's  from  the  average  total  of  80  PSU's  per  dis- 
trict is  the  desired  sample  size.  Thus,  the 
sampling  interval  needed  in  the  average  district 
for  selecting  the  PSU's  is  4.   In  order  to  keep 
the  sample  self -weighted,  this  same  interval  will 
be  used  in  all  districts.  It  will  produce  fewer 
than  20  PSU's  in  small  districts  and  more  than  20 
in  large  districts,  with  resulting  variations  in 
expected  sampling  errors. 

A  random  start  between  1  and  4  will  be  se- 
lected for  the  first  district.  Every  fourth  PSU 
in  this  and  the  remaining  districts  will  then  be 
selected.  When  the  last  PSU  in  the  first  district 
is  selected,  either  0,  1,  2,  or  3  PSU's  will  re- 
main in  that  district.  These  will  be  counted  in 
finding  the  first  sample  PSU  in  the  second  dis- 
trict. Thus,  if  two  PSU's  were  remaining  in  the 
first  district,  then  the  second  PSU  of  the  second 
district  (the  fourth  since  the  last  previous  se- 
lection) will  be  the  starting  point  in  that  dis- 
trict.  In  other  words,  the  selection  is  carried 
out  from  a  continuous  list  of  PSU's  and  not  sepa- 
rately within  each  district. 
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The  selection  of  PSU's  is   illustrated  for 
Ranchi  District  in  Rajpur  Province  in  exhibit 
IV-3-8   (Form  A-261).     After  the  PSU's  are  ordered 
as  described  above,   the  sample  PSU's  will  be  se- 
lected systematically  from  the  cumulated  measures 
of  size    (equal  probability  except  where  a  PSU  has 
a  measure  of  size  other  than  1).     Since  the  sam- 
pling interval  has  been  set  at  4,    a  random  start- 
ing number  from  1  to  4  is  selected  to  identify  the 
first  sample  PSU,   and  then  the  remaining  units 
are  identified  by  successively  adding  4.     For 
example,   if  the  random  start  is   3,   the  selected 
PSU's  will  be  those  corresponding  to  the  measures 
of  size  numbers   3,    7,   11,   15,   etc.3 

In  exhibit  IV-3-8,    the   "start  with"  number  is 
"3"   (assuming  a  carry-over  of   "1"  from  the  pre- 
vious district)   and  the    "take  every"  is   "4".   Thus, 
the  first  sample  PSU  is  EA  002  in  subdistrict  01. 
(Note  that  EA  001  has  2  measures  of  size.)     The 
second  sample  PSU  is  the  second  part  of  EA  005  in 
subdistrict  01   (since  EA  005  has  2  measures  of 
size).       The  third  sample  PSU  is  EA  006  in  subdis- 
trict 02;    the  fourth  is  EA  009  in  subdistrict  02; 
etc.     The  sample  PSU  No.    in  column  7  starts  with 
01  in  each  district. 


We  refer  to  the  method  as   sampling  with  equal  proba- 
bility,   since   in  the  cases  where  measures  of   size  are  not 
1,    the  PSU  finally  formed  after  subdivision  will  have  had 
the   same  probability  of   selection  as  all  others. 

4 

The  two  parts  are  numbered  by  a  random  process  as  first 
or  second  parts. 


This  method  of  selection  of  PSU's  permits  the 
use  of  a  simple  estimation  method.  The  Local 
Certainty  holdings  will  all  have  the  same  weight, 
namely  4.   If  the  PSU's  were  selected  with  unequal 
probabilities,  it  would  be  possible  to  keep  a 
constant  overall  probability  for  small  holdings, 
but  it  would  be  necessary  to  vary  the  weights 
which  must  be  applied  to  Local  Certainty  holdings 
in  different  PSU's.  This  would  make  the  tabu- 
lation of  the  census  results  much  more  difficult 
and  would  be  subject  to  errors  in  weighting. 

The  efficiency  of  the  systematic  selection 
of  PSU's  results  mainly  from  the  fact  that  it 
produces  nearly  the  same  sample  proportion — namely 
one  fourth — of  the  PSU's  from  each  of  the  homo- 
geneous groups  formed  in  the  stratification 
process . 

After  the  sample  PSU's  are  selected  they  are 
given  a  sample  PSU  number  in  column  7  of  Form 
A-261.  The  sample  PSU's  are  listed  on  PSU  Control 
Record  for  Map  Preparation,  Form  A-271  (see  ex- 
hibit VI-2-4,  which  is  included  in  this  unit  as 
a  reference  exhibit)  and  copies  are  sent  to  the 
Geography  Division  for  appropriate  action.  Ex- 
hibit II-3-6  shows  the  map  of  the  sample  PSU's 
for  Ranchi  district  in  Rajpur  Province . 

Form  A-252  (exhibit  IV-3-7)  is  a  District 
Summary  of  the  number  of  PSU's  and  National  Cer- 
tainty holdings  in  the  sample .   It  is  used  by  the 
Field  Division  to  inform  the  Province  Census 
Offices  (and  district  supervisors)  of  the  number 
of  sample  PSU's  in  each  district. 


Chapter  IV  -  4.    LISTING  WITHIN  THE  SAMPLE  PSU  AND  SELECTION  OF  THE  SAMPLE  HOLDINGS 


1.    INTRODUCTION 

The  major  reasons  for  selecting  the  sample 
holdings  in  two  stages  were  discussed  in  chapter  2 
of  Unit  IV.  The  first  reason  is  the  substantial 
saving  in  travel  costs  of  enumerating  a  sample  of 
holdings  selected  in  two  stages  as  compared  with  a 
one-stage  sample  of  individual  holdings  which  would 
provide  the  same  reliability. 

The  second  reason  for  selecting  the  sample  in 
two  stages  is  the  reduction  in  the  amount  of  work 
to  prepare  the  frame  of  holdings.  To  select  the 
sample  in  one  stage,  a  list  of  all  holdings  in  the 
country  would  be  required.  Such  a  list  is  not 
available,  and  preparing  one  for  the  entire  country 
would  be  not  only  expensive  but  also  impractical. 
It  is  practical,  however,  to  prepare  a  list  of 
holdings  which  are  located  in  a  sample  of  PSU's. 
Thus,  for  the  Agrostan  Sample  Census  of  Agriculture, 
a  sampling  frame  (list  of  holdings)  is  required  for 
each  sample  PSU  so  that  a  sample  of  holdings  can  be 
selected  from  each.   In  this  chapter,  the  steps 
involved  in  preparing  the  sampling  frame  for  each 
selected  PSU  and  for  selecting  the  sample  of  hold- 
ings will  be  discussed  and  illustrated. 

2.    LISTING 

The  preparation  of  the  sampling  frame  of  hold- 
ings for  a  PSU  begins  with  making  a  list  of  all  the 
housing  units  in  the  PSU.  This  is  done  in  the 
"listing"  operation  through  the  use  of  the  Listing 
Book  Form  A- 2a  (see  exhibit  17-4-1) .   One  line  of 
the  Listing  Book  must  be  filled  for  each  housing 
unit  in  the  PSU.   On  this  line,  certain  information 
is  recorded  for  each  housing  unit,  which  determines 
whether  or  not  the  housing  unit  identifies  an  agri- 
cultural holding;  that  is,  whether  any  member  of  the 
household  operates  an  agricultural  holding,  either 


on  his  own  account,  in  partnership,  as  a  hired 
manager,  or  as  a  chieftain. 

2.1   Relation  between  housing  units,  holders,  and 
holdings 

The  reasons  for  associating  a  holding  with  a 
housing  unit  and  with  a  particular  household  mem- 
ber (the  holder)  were  discussed  earlier  in  the  case 
study  (chapter  1  of  Unit  II  and  chapter  1  of  Unit 
III) .   One  of  the  most  important  reasons  is  that 
census  information  must  be  obtained  from  the  holder 
or  a  member  of  his  family,   where  the  holder  does 
not  live  on  his  holding,  it  is  easier  to  find  him 
by  way  of  his  housing  unit  than  by  way  of  his 
holding. 

Some  housing  units  will  not  be  associated  with 
holdings,  since  they  will  have  no  household  member 
who  operates  a  holding.  These  are  the  housing 
units  with  answers  of  "No"  in  both  column  4  and 
column  5  of  the  Listing  Book. 

A  few  households  may  have  two  or  more  members, 
each  of  whom  has  a  separate  holding.  The  Listing 
Book  provides  for  listing  the  second  holder  (and 
each  successive  holder)  on  separate  lines  (see  the 
instructions  at  the  top  of  column  6  of  the  listing 
page). 

There  may  also  be  cases  where  a  single  house- 
hold member  operates  more  than  one  holding.   If  so, 
the  separate  holdings  are  listed  on  separate  sub- 
lines beginning  with  column  7. 

With  this  listing  procedure,  there  may  be  a  few 
holdings  inside  the  PSU  which  are  not  included  in 
the  sample  frame  for  the  PSU,  because  the  holders 
live  outside  the  PSU.  They  would  be  included  in  the 
sampling  frame  for  the  PSU  in  which  they  live.  Thus 
they  will  have  their  proper  probability  of  inclu- 
sion in  the  sample. 
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The  final  frame  of  holdings  to  be  sampled  in 
the  PSU  consists  of  all  the  holdings  listed  in  the 
PSU,  except  for  those  defined  as  out-of -scope  for 
the  census,  as  described  in  section  2.3  below. 

Exhibit  IV-4-1  illustrates  a  Listing  Book  in 
which  each  housing  unit  in  PSU  19  of  Ranchi  District 
(Rajpur  Province)  is  listed  in  column  3,  and  each 
agricultural  holding  associated  with  the  housing 
unit  is  listed  in  column  7.  The  information  on 
size  of  the  holding  identifies  holdings  which  are 
out-of -scope  of  the  census;  it  is  used  also  for 
stratification  of  holdings  by  size. 

2.2   Definitions  of  holder  and  holding 

Many  of  the  problems  in  sampling  agricultural 
holdings  arise  from  the  difficulties  of  defining  a 
"holding"  in  such  a  way  that  every  piece  of  agri- 
cultural land  will  have  one  and  only  one  chance  of 
being  selected  in  a  sample.  This  requires  a  defini- 
tion that  will  (a)  associate  every  piece  of  agricul- 
tural land  with  one  and  only  one  holding,  and  (b) 
identify  that  holding  with  one  and  only  one  PSU. 

A  precise  definition  of  a  holding  is  given  in 
Unit  III  of  the  case  study  and  the  definition  is 
applied  in  questions  5  to  14  of  the  Agriculture 
Questionnaire,  Form  A-la.   Generally,  a  holding  is 
one  or  more  plots  of  land,  operated  as  a  single 
unit,  on  which  crops  or  livestock  are  produced. 
The  holding  may  be  operated  by  an  individual  on  his 
own  account;  it  may  be  operated  by  a  hired  manager, 
a  chieftain,  or  by  a  partnership.  There  is  thus  a 
one-to-one  relation  between  holdings  and  holders 
except  for  (a)  partnerships,  in  which  a  single 
holding  is  associated  with  more  than  one  holder, 
and  (b)  holders  who  operate  two  or  more  separate 
holdings. 

In  order  to  associate  each  holding  with  one 
and  only  one  PSU,  we  use  the  house  where  the  holder 
lives  to  decide  with  which  PSU  the  holding  should 
be  associated. 

In  the  case  of  partnerships,  if  the  partners 
live  in  the  same  household,  the  instructions  at 
the  top  of  column  6  of  the  Listing  Book  provide 
for  making  only  one  listing  in  column  7;  for  ex- 
ample, line  6  on  page  1  of  the  listing  pages  in 


exhibit  IV-4-1  lists  two  partners  for  one  holding. 
If  the  partners  live  in  separate  households,  the 
instructions  on  the  Agriculture  Questionnaire  it- 
self will  eliminate  the  holding  if  any  partner 
other  than  the  "principal  partner"  is  selected  for 
the  sample  (see  question  2  of  Form  A-la).  Thus, 
the  holding  is  associated  with  only  one  PSU — that 
in  which  the  principal  partner  lives . 

In  the  case  of  holders  who  operate  two  or  more 
holdings,  each  such  holding,  wherever  it  may  be 
located,  is  included  in  the  listing  for  the  PSU  in 
which  the  holder  lives.  For  example,  the  holder 
listed  on  line  9  of  page  1  of  the  listing  pages  in 
exhibit  IV-<4-l  has  two  holdings .  One  is  a  small 
holding  of  2  3/lO  hectares  which  he  operates  "on 
own  account"  and  which  includes  his  house  and 
garden;  the  other  is  a  78-hectare  holding  of  which 
he  is  the  hired  manager.  The  latter  holding  is 
located  outside  this  PSU  but  is  included  in  the 
sample  frame  for  this  PSU. 

We  have  thus  provided  a  way  to  associate  every 
piece  of  agricultural  land  with  one  and  only  one 
holding,  and  every  holding  with  one  and  only  one 
chance  to  be  included  in  the  sample.   Column  6  of 
the  listing  page  provides  a  list  of  all  holders, 
and  column  7  (sublines  that  are  checked)  provides 
a  list  of  all  holdings . 

2.3  Out-of-scope  holdings 

The  holdings  of  less  than  l/lO  hectare  are 
excluded  from  the  scope  of  the  Agriculture  Census 
unless  they  qualify  as  National  Certainty  (NC) 
holdings  because  of  specialized  agricultural  op- 
erations (see  section  2.1  in  chapter  2  of  Unit  IV). 
It  is  assumed  that  many  of  these  very  small  holdings 
have  only  vegetable  gardens  or  animals  for  home 
consumption.  The  Listing  Book  provides  for  listing 
and  identifying  these  holdings  (column  8) .  Thus, 
a  count  of  them  is  obtained  in  the  sample  PSU' s  and 
an  estimate  of  their  total  number  can  be  made. 

2.4  Listing  problems 

Listing  all  the  housing  units  in  a  relatively 
small  geographic  area  may  sound  like  an  easy  task. 
Actually  it  can  be  a  complex  undertaking.  The 
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enumerator  must  visit  each  housing  unit  in  the  PSU, 
determine  the  number  of  holdings  associated  with 
it,  and  complete  the  listing  for  each  holding. 

In  making  the  listing,  he  should  be  aware  of 
the  most  common  causes  of  duplications  and  omis- 
sions. 

To  avoid  duplications,  the  enumerator  must  be 
alert  for  cases  of  multiple  holders  of  single  hold- 
ings; partnerships  will  be  the  most  common  example 
of  this  in  the  Provinces.  The  enumerator  must  also 
be  able  to  determine  when  two  or  more  agricultural 
operations  comprise  more  than  one  holding  and  when 
they  all  belong  to  the  same  holding. 

Probably  the  most  common  cause  of  omissions 
in  the  listing  results  from  the  enumerator  missing 
isolated  housing  units  in  his  canvass  of  the  PSU. 
Another  important  cause  of  omissions  is  the  tend- 
ency of  an  enumerator  to  list  one  holding  at  a 
housing  unit  and  then  to  forget  to  ask  about  addi- 
tional household  members  who  have  holdings  which 
should  be  listed  separately. 

The  best  protection  against  duplication  and 
omissions  is  a  carefully  planned  listing  procedure. 

3.    PREPARING  THE  SAMPLING  FRAME 

The  enumerator,  in  filling  columns  8  to  12  of 
the  Listing  Book,  will  provide  the  information 
needed  for  identifying  the  small,  out-of -scope 
holdings,  and  for  stratifying  the  remaining  hold- 
ings into  four  size  groups.  Any  holding  which  has 
an  entry  in  column  9,  10,  11,  or  12  of  the  Listing 
Book  is  a  possible  unit  in  the  final  sampling  frame . 
However,  one  more  adjustment  must  be  made  by  the 
supervisor  before  the  sampling  frame  is  ready.  He 
must  determine  whether  the  listing  units  in  columns 
9  through  12  include  any  National  Certainty  hold- 
ings .  National  Certainty  holdings  have  already 
been  listed  in  another  sampling  frame  and  must  be 
eliminated  from  the  list  of  holdings  within  the 
sample  PSU.  The  supervisor,  just  before  he  selects 
the  sample  holdings  in  a  PSU,  will  use  a  wavy  line 
to  mark  out  the  entries  in  columns  8  through  12 
for  National  Certainty  holdings .  All  other  hold- 
ings with  an  entry  in  one  of  these  columns  will 


comprise  the  final  sampling  frame  for  the  PSU. 
(All  the  National  Certainty  holdings  will  be  as- 
signed to  enumerators  from  the  National  Certainty 
Listing,  Form  A-2c  (see  exhibit  TV-3-2). 

4.    SELECTING  THE  SAMPLE  HOLDINGS 

The  sample  holdings  will  be  selected  by  the 
supervisor  during  his  first  visit  to  the  enumerator 
after  the  listing  for  a  PSU  has  been  completed. 
It  might  seem  strange  that  after  completing  a 
complex  job  like  the  listing  operation,  the  enumer- 
ator is  not  permitted  to  select  the  sample  himself. 
There  are  two  major  reasons  for  requiring  the 
supervisor  to  make  the  selection.   First,  survey 
experience  shows  that,  when  the  enumerator  makes 
the  selection,  he  is  likely  to  include  a  larger- 
than-expected  proportion  of  certain  types  of  units 
which  he  has  some  reason  for  preferring  to  enumer- 
ate. This,  of  course,  results  in  a  biased  sample; 
the  procedure  will  be  avoided  in  Agrostan. 
Secondly,  it  is  advisable  to  have  some  control  over 
the  supervisor's  activities.   Requiring  him  to 
visit  the  enumerator  to  select  the  sample  is  some 
assurance  that  each  enumerator's  work  will  be 
given  at  least  a  minimum  review  before  the  sample 
is  selected.  After  his  review,  the  supervisor 
will  select  a  sample  from  the  PSU;  this  sample 
will  include  all  Local  Certainty  holdings  and  one- 
fourth  of  the  small  holdings. 

4.1    Local  Certainty  holdings 

The  definition  of  a  Local  Certainty  holding 
is  based  on  size  of  holding  and  will  vary  by 
Province.   In  Liberto,  Lopez,  Morgan,  and  Tali 
Provinces  all  holdings  in  sample  PSU' s  with  50 
hectares  or  more  and  which  are  not  National 
Certainty  holdings,  will  be  Local  Certainty  hold- 
ings. The  size  limitation  is  100  hectares  in 
Bolivar,  Dubois,  Hassam,  Lo-San-Tho,  and  Paris 
Provinces  and  is  200  hectares  in  Dewar,  Rajpur, 
Rama,  and  Valencia  Provinces. 

The  reason  why  the  definition  of  Local 
Certainty  holdings  varies  is  that  agriculture  is 
more  intensive  in  some  Provinces  than  in  others. 
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Thus,  in  Lopez  Province  where  the  soil  is  fertile 
and  the  rainfall  is  plentiful  a  holding  of  50 
hectares  contributes  heavily  to  agricultural 
production.   In  contrast,  a  holding  of  50  hectares 
is  not  especially  important  in  Rajpur  Province 
with  its  arid  climate  and  infertile  soil.  The 
objective  of  the  procedure  for  Local  Certainty 
holdings  is  to  identify  the  more  important  hold- 
ings which  are  not  on  the  National  Certainty  List 
and  to  sample  these  holdings  at  a  higher  rate  than 
the  small  holdings . 

Since  Ranchi  District  is  in  a  Province 
(Rajpur)  where  the  size  limitation  for  Local  Cer- 
tainty holdings  is  200  hectares,  all  of  the  hold- 
ings in  the  PSU  with  an  entry  in  column  12  (200  or 
more  hectares)  will  be  in  the  sample  with  certainty. 
In  general,  about  one -fourth  of  all  the  Local 
Certainty  holdings  will  be  in  the  sample  in  every 
district,  since  a  fourth  of  the  PSU' s  are  in  the 
sample . 

4.2   Small  holdings 

From  the  small  holdings  which  remain  in  the 
sampling  frame,  one-fourth  will  be  selected 
systematically  in  each  sample  PSU.  The  supervisor 
will  be  provided  with  sampling  instructions  which 
include  two  numbers,  one  to  identify  the  first 
sample  holding  in  the  smallest  size  category  (the 
"start  with"  number),  and  another  for  the  sampling 
interval  (the  "take  every"  number).  The  sample 
is  then  selected  systematically  from  each  size 
category  in  the  Listing  Book  for  the  PSU. 

In  the  Provinces  of  Agrostan,  the  "take  every" 
number  is  always  4  and  the  "start  with"  number  is 
always  a  number  between  1  and  4  inclusive.   Let 
us  suppose  that  the  "start  with"  was  4  so  that  we 
can  apply  the  sampling  instructions  to  exhibit 
IV-4-1.  Then  in  column  9  of  exhibit  IV-4-1,  the 
first  sample  holding  is  the  fourth  listed  holding. 
Then  the  8   holding,  the  12   holding,  etc.,  from 
column  9  are  selected  until  the  last  holding  listed 
in  column  9  (on  the  last  page  of  the  Listing  Book) 
has  been  counted.  The  remainder  of  the  interval 
is  carried  into  the  next  size  category  in  column 
10  (on  the  first  page  of  the  Listing  Book),  and 
finally  the  process  is  continued  through  column  11. 


(The  holdings  listed  in  column  12  will  all  be 
Local  Certainty  holdings  in  Rajpur  Province.) 

The  result  of  the  sampling  process  is  the 
1  in  4  stratified  sample  of  small  holdings  in  the 
PSU,  the  sample  cases  identified  by  the  circled 
entries  in  columns  9  to  11  on  the  listing  pages 
(see  exhibit  IV-4-1).  All  Local  Certainty  holdings 
will  also  be  circled  for  enumeration.   In  some 
Provinces,  the  sampling  may  end  with  column  9  or 
10,  depending  on  the  definition  of  Local  Certainty 
holdings  in  the  Province. 

The  enumerator  will  be  instructed  to  return  to 
each  of  the  sample  small  holdings  and  to  each  Local 
Certainty  holding  to  conduct  an  interview  to  obtain 
information  for  the  Agriculture  Questionnaire. 

4.3   PSU's  with  a  very  large  number  of  small 
holdings 

In  spite  of  the  steps  taken  to  approximately 
equalize  the  PSU's  in  terms  of  number  of  holdings, 
there  will  probably  be  some  PSU' s  that  vary  widely 
from  the  desired  number  of  about  80.   In  most 
cases  this  will  simply  result  in  a  variable  work- 
load of  listing  and  interviewing.  However,  in  the 
possible  case  that  the  number  of  eligible  small 
holdings  in  the  sampling  frame  is  more  than 
double  the  expected  number,  another  procedure  will 
be  used  to  avoid  having  the  interviewer  spend  an 
excessive  amount  of  time  in  one  PSU. 

In  such  a  case,  the  supervisor  will  select 
the  sample  of  small  holdings  at  a  rate  of  one- 
half  (or  at  the  least,  one-third)  of  the  usual 
rate.  After  preparing  the  sampling  frame  as 
described  above,  he  will  enter  the  counts  of  hold- 
ings on  the  cover  of  the  Listing  Book.  By  adding 
appropriate  columns  of  line  j  ("Eligible  for 
PSU  sample"),  he  will  find  the  total  number  of 
small  holdings  in  the  frame.   If  this  number  is 
over  160,  the  usual  1  in  4  subsampling  rate  would 
produce  a  sample  of  over  40  small  holdings. 
Instead,  he  will  use  a  rate  of  half  the  usual  rate 
or  1  in  8.   In  the  unlikely  event  that  the  number 
is  over  240,  he  will  use  one-third  of  the  usual 
rate  or  1  in  12.  He  should  always  record  such 
unusual  subsampling  rates  in  the  "Notes"  space  on 
the  Listing  Book  cover  (for  example,  "l/2  sample 
of  small  holdings"). 


Chapter  IV  -  5.    SAMPLING  IN  THE  NORTHERN  TERRITORY 


1.    INTRODUCTION 

The  sampling  procedures  which  will  be  used 
in  the  Northern  Territory  are  slightly  different 
from  those  used  in  the  Provinces .  These  proce- 
dures and  the  conditions  in  the  Territory  which 
require  them,  are  discussed  in  this  chapter. 

The  Northern  Territory  is  largely  undevel- 
oped and  consists  of  three  zones  (the  Atbara  River 
Basin,  the  Northwest,  and  the  SuMc  Mountains). 
The  population  of  the  Territory  is  basically  a 
subsistence-farming  population.   In  one  of  the 
zones,  the  Northwest,  there  are  a  few  nomadic 
tribes  which  raise  sheep,  goats,  and  cattle  for 
their  livelihood.  Most  of  the  population  lives 
in  tribal  chief doms,  although  the  tribal  struc- 
ture is  beginning  to  break  down. 

The  climate  is  essentially  arid.  Most  of  the 
field  crops  which  require  much  moisture  are  pro- 
duced along  the  Atbara  River,  where  some  irriga- 
tion is  practiced.  The  rest  of  the  Territory  pro- 
duces mostly  livestock  or  crops  which  will  grow 
under  relatively  dry  conditions .  Because  of  the 
higher  per-holding  cost  to  take  a  census  in  the 
Territory  and  because  it  was  felt  that  the  pre- 
dominantly subsistence -type  agriculture  had  little 
effect  on  the  total  production  of  Agrostan,  the 
Northern  Territory  was  not  covered  in  the  1961 
Sample  Census  of  Agriculture.  The  materials  used 
for  planning  the  1971  Sample  Census  of  Agriculture 
in  the  Territory  will  come  from  the  1970  Popula- 
tion Census  and  other  sources . 

2.    CENSUS  OBJECTIVES  AND  PROBLEMS 

Since  agricultural  development  planning  is 
of  a  more  general  nature  in  the  Territory,  the 
requirements  for  detailed  agricultural  data  are 
not  as  great  as  for  the  Provinces.  On  the  other 
hand,  some  special  problems  exist  in  the  Northern 
Territory  which  make  it  necessary  to  modify  the 
procedures  adopted  for  the  Provinces . 


2.1    Census  objectives 

In  terms  of  data  requirements,  fewer  items 
will  be  covered  and  fewer  estimates  by  subgroups 
will  be  published.  The  required  level  of  relia- 
bility of  the  census  estimates  will  also  be  lower 
in  the  Northern  Territory  than  in  the  Provinces . 

2.11  Estimates  for  zones . — In  the  Provinces, 
data  will  be  produced  for  each  district;  estimates 
by  district  will  be  combined  into  estimates  for 
individual  Provinces  and  for  the  13  Provinces  as  a 
whole.   In  the  Northern  Territory,  on  the  other 
hand,  it  was  determined  that  data  would  not  be 
required  by  district  or  any  other  small  adminis- 
trative area.   Instead,  data  would  be  required  at 
the  zone  level.  The  sampling  plan,  therefore, 
provides  that  estimates  can  be  made  for  each  of 
the  three  zones  and  for  the  Territory  as  a  whole . 
It  was  decided  that  resources  which  would  normally 
be  used  to  provide  estimates  for  smaller  areas 
could  better  be  spent  either  in  improving  data  for 
the  Provinces  or  in  improving  the  zone  estimates 
for  the  more  important  characteristics . 

2.12  Estimates  for  subgroups . — There  will  be 
fewer  subgroups  requiring  separate  estimates  in 
the  Territory  than  in  the  Provinces.  This  reduc- 
tion will  result  mainly  from  limiting  the  number 
of  cross -classifications  in  the  tabulation  plans. 
Therefore,  estimates  will  be  made  for  most  of  the 
broad  classes  of  data,  but  much  less  detailed  in- 
formation will  be  produced. 

2.13  Accuracy  of  the  estimates . — Since  the 
census  data  for  the  Northern  Territory  will  be  used 
to  make  broad  decisions,  the  accuracy  of  the  esti- 
mates for  each  of  the  zones  need  not  be  as  great  as 
for  the  average  district  in  the  Provinces .   It  is 
believed  an  error  level  of  10%  can  be  tolerated 


"Errors"  as  discussed  in  this  section  refer  to  the 
coefficient  of  variation  of  the  estimates.  They  should 
he  doubled  to  obtain  the  error  limits  for  the  95$  confi- 
dence level. 
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for  characteristics  associated  with  half  of  all 
the  holdings,  and  that  decisions  based  on  these 
estimates  will  still  be  valid.  It  was  decided, 
however,  that  this  error  level  should  apply  to 
such  characteristics  in  the  smallest  zone;  cor- 
responding errors  in  the  larger  zones  will  be 
smaller.  For  characteristics  associated  with 
fewer  than  half  of  all  holdings  in  a  zone,  a 
larger  error  in  the  estimate  will  be  tolerated. 
Characteristics  associated  with  more  than  half 
the  holdings  will  have  smaller  errors,  as  will 
estimates  for  the  Territory  as  a  whole. 

2.2   Census  problems 

In  addition  to  having  different  objectives 
in  the  Northern  Territory,  the  problems  of  census 
taking  are  much  greater  there  than  in  the  Provinces. 
These  problems  will  affect  the  procedures  which 
can  be  used  in  carrying  out  such  a  large  operation. 

2.21  Mapping. — The  lack  of  suitable  maps  for 
census  work  is  perhaps  the  greatest  problem  of  all. 
The  more  detailed  maps  required  for  census  work 
are  not  available,  although  there  are  large-scale 
maps  of  the  whole  Territory.  For  the  1970  Popu- 
lation Census,  enumerator  assignments  were  made  up 
largely  from  a  list  of  chiefdoms;  population  living 
on  land  not  associated  with  a  chiefdom  was  enumer- 
ated in  Special  EA's  (special  enumeration  areas). 
The  boundaries  of  chiefdoms  usually  were  not  accu- 
rately shown  on  maps  and  were  not  clearly  visible 
in  the  field.  It  is  believed  that  many  of  the 
Special  EA's  were  not  canvassed  as  thoroughly  as 
the  chiefdoms  and  a  large  proportion  of  the  popu- 
lation in  such  areas  might  have  been  missed.  The 
1970  Population  Census  enumerators  were  required 
to  make  a  rough  sketch  of  the  Special  EA's.  How- 
ever, there  were  no  enumerator  maps  prepared  for 
the  chiefdoms,  since  each  tribal  group  was  able  to 
identify  which  population  made  up  its  chiefdom. 

The  agricultural  census  enumerator's  assign- 
ment will  generally  consist  of  two  or  three  chief- 
doms or  Special  EA's.  An  effort  will  be  made  during 
the  census  preparations  to  compile  an  accurate  list 
of  all  the  chiefdoms  and  Special  EA's  from  the  1970 


Census  of  Population  results.  This  list  will  serve 
as  the  frame  of  primary  sampling  units  covering 
the  Territory.  After  the  sample  PSU's  are  selected, 
crude  maps  will  be  prepared  on  which  the  field 
supervisor  will  mark  the  name  and  approximate  lo- 
cation of  the  sample  chiefdom.  The  enumerator  will 
be  given  one  of  these  maps  to  help  him  find  the  se- 
lected chiefdoms.  For  the  Special  EA's  in  the 
sample,  he  will  be  given  a  copy  of  the  sketch  pre- 
pared by  the  1970  Population  Census  enumerator. 

2.22  Response  error. — The  responses  by  the 
holders  in  the  Northern  Territory  may  sometimes  be 
in  error.  Because  of  their  isolation,  they  do  not 
think  in  terms  of  area  or  volume  in  standard  units. 
Therefore,  the  enumerator  will  generally  not  be 
able  to  obtain  acceptable  information  from  the  re- 
spondent on  the  area  under  cultivation  or  the  a- 
mount  of  production.   Instead,  data  on  area  of  the 
smaller  holdings  will  be  obtained  by  measuring  a 
subsample  of  such  holdings,  and  production  data 
will  be  obtained  only  for  five  major  crops . 

It  is  expected  that  often  the  respondent  will 
be  highly  suspicious  of  the  interviewer,  thinking 
of  him  as  a  government  tax  collector  or  other  offi- 
cial seeking  individual  information  which  might  be 
used  to  the  respondent's  disadvantage.  Therefore, 
many  respondents  may  hesitate  to  give  information 
or  may  even  give  some  false  information. 

2.23  Polygamy . — The  practice  of  polygamy 
still  exists  in  the  Northern  Territory.  This 
presents  many  problems  in  listing  the  holdings 
since  the  wives  in  most  cases  would  be  considered 
separate  holders .  The  relationship  of  one  wife 
to  another  is  not  always  clear,  however,  and  the 
enumerator  may  have  difficulty  deciding  whether 
the  wives  have  separate  holdings .  Normally,  the 
wives  live  in  different  households;  occasionally, 
they  live  in  different  chiefdoms .  Those  who  live 
in  different  chiefdoms  will  not  present  many  prob- 
lems since  their  agricultural  activities  are  al- 
most always  independent  of  each  other.  When  the 
wives  live  in  the  same  chiefdom  and  have  indepen- 
dent holdings,  the  enumerator  must  make  sure  that 
each  holding  is  listed  separately. 
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Often  two  or  more  wives  who  live  in  the  same 
chiefdom  will  grow  their  crops  on  the  same  plot  of 
land,  with  each  contributing  part  of  the  labor, 
seed,  fertilizer,  etc.   It  may  be  difficult  to 
separate  the  activities  of  one  wife  from  another; 
and  in  a  sense  the  holding  is  a  partnership.  The 
holding  of  the  senior  (first -married)  wife  would 
be  listed  as  a  partnership  holding;  and  if  selec- 
ted for  the  sample,  an  Agriculture  Questionnaire 
would  be  completed.  The  holding  would  also  be 
listed  when  the  household  of  a  junior  wife  is 
listed;  but  if  selected  in  the  sample,  an  Agricul- 
ture Questionnaire  would  not  be  completed.   (See 
question  3  of  the  Agriculture  Questionnaire  for 
the  Northern  Territory,  exhibit  III-2-3.) 

One  of  the  problems  which  affects  the  agri- 
culture census  is  that  the  husband  cannot  easily 
be  associated  with  a  single  holding,  since  he 
generally  divides  his  time  among  the  several  house- 
holds. The  husband  usually  has  agricultural  oper- 
ations of  his  own,  such  as  cattle  or  other  live- 
stock, and  these  cannot  always  be  associated  with 
a  single  housing  unit  in  the  listing  process.   In 
Agrostan,  the  husband  will  be  included  in  the  sam- 
ple at  the  household  of  only  his  senior  wife.  His 
cattle,  other  livestock,  or  other  independent  agri- 
cultural activities  would  then  be  included  with 
those  of  the  wife  with  whom  he  is  listed. 

2.2*4  Between-PSU  travel . —The  chief doms  of 
the  Northern  Territory  are  not  usually  located 
along  main  roads  or  other  thoroughfares  which  can 
be  traveled  by  automobiles.  Normally,  the  inter- 
viewer must  ride  a  bicycle  or  walk  long  distances 
from  the  road  in  order  to  reach  his  assigned  chief- 
dom. Therefore,  only  two  or  three  sample  PSU's 
can  be  assigned  to  an  enumerator.  Many  chief doms 
will  be  located  in  isolated  spots  in  rugged  moun- 
tainous terrain,  or  in  other  fairly  inaccessible 
places.  Such  locations  will  make  supervision 
very  difficult. 

2.25  Cultivation  practices. — The  cultivation 
practices  of  the  holders  in  the  Northern  Territory 
will  also  cause  some  problems.  The  holdings  are 
normally  quite  small  in  size  and  irregular  in  shape. 


Since  most  holders  will  not  be  able  to  provide 
information  about  the  area  cultivated,  objective 
measurement  of  the  crop  area  will  be  carried  out.3 
Such  measurements  for  an  individual  holding  will 
probably  not  be  entirely  accurate,  especially  when 
the  holding  is  irregular  in  shape. 

As  in  the  Provinces,  there  is  the  practice 
of  interplanting,  where  two  or  more  crops  are 
grown  together.  For  example,  peas,  peanuts  or 
beans  are  often  grown  together  with  maize.   A 
second  crop  is  normally  grown  along  with  coffee. 
There  will  also  be  examples  of  succession  crops 
where  one  crop  is  produced  and  harvested  and  another 
is  planted  and  harvested  on  the  same  land  in  the 
same  year.   Such  cultivation  practices  require 
special  census  concepts,  definitions,  and  proce- 
dures (see  Unit  III  of  the  case  study). 

3.  SAMPLE  DESIGN 

The  sample  design  for  the  Northern  Territory 
will  take  account  of  the  special  objectives  and 
problems  of  this  area.   Included  will  be  a  descrip- 
tion of  the  sample  design  for  objective  measurement 
of  the  crop  area  for  a  subsample  of  holdings . 

3.1    Considerations  affecting  the  sample  design 

Several  considerations  affect  the  sample 
design.     Among  them  are  the  definition  and  size 
of  the  universe,    the  definition  and  size  of  PSU's, 
and  the  total  sample  size  needed. 

3.11     Definition  of  the  universe  and  esti- 
mated total  number  of  holdings .--The  universe,    or 
population  of  units  about  which  agricultural  in- 
formation will  be  obtained,    consists   of  all  agri- 
cultural holdings  with  one  exception:     those  lo- 
cated in  the  built-up  portion  of  the   capital  city 
of  Zanzi . 

As  in  the  Provinces,   the  preliminary  popu- 
lation counts  from  the  1970  Population  Census  pro- 
vide a  basis  for  estimating  the  number  of  holdings 


Objective  measurement  will  be  applied  only  to  the  small 
holdings.  It  is  assumed  that  operators  of  large  holdings 
will  be  able  to  respond  directly  to  the  questions  on  land 
area. 
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in  the  universe.  Tnese  counts  are  shown  in  exhib- 
its IV-5-1,  IV-5-2,  and  IV-5-3.  After  eliminating 
the  PSU's  and  the  estimated  population  in  the 
built-up  sections  of  the  capital  city  and  clas- 
sifying the  remaining  population  by  size  and  kind 
of  PSU  (urban  or  rural)  in  which  it  is  located, 
results  shown  in  the  first  three  columns  of  ex- 
hibit IV- 5 -4  are  obtained.  From  these  data  and 
certain  assumptions  described  in  the  footnotes  of 
that  exhibit,  the  following  rough  estimates  of  the 
universe  numbers  of  holdings  in  each  of  the  three 
zones  were  made: 


Zone 

Holdings 
(estimated) 

Total 

70,000 

29,000 

24,000 

17,000 

It  was  expected  in  the  early  stage  of  plan- 
ning that  there  would  be  about  100  very  large 
holdings  to  be  classed  as  National  Certainty 
holdings,  and  thus  included  in  the  census  with 
certainty.   It  was  decided  not  to  attempt  a  spe- 
cial sampling  of  Local  Certainty  holdings  in  the 
Northern  Territory. 

3.12  Definition  of  PSU's  and  estimated 
numbers  by  population  size. — As  mentioned  earlier, 
PSU's  will  consist  of  chiefdoms  or  Special  EA's 
as  used  for  enumerator  assignments  in  the  1970 
Population  Census.  The  estimated  numbers  of  PSU's 
(chiefdoms  plus  Special  EA's)  for  each  of  the 
zones  are  as  follows: 


Zone 

Estimated 
PSU's 

Total 

1  969 

821 

677 

471 

3.13  Estimated  total  sample  size  needed . — 
The  total  sample  size  needed  was  determined  by 
the  reliability  requirement  for  the  smallest  zone 
(the  Subic  Mountains),  plus  the  fact  that  a  con- 
stant overall  sampling  rate  for  small  holdings  in 


all  zones  was  desired.  Following  the  same  rea- 
soning as  in  the  Provinces,  a  sample  of  about  400 
small  holdings  in  the  Subic  Mountains  would  produce 
estimates  with  about  the  same  reliability  for 
small  holdings.   With  fewer  certainty  holdings  in 
the  smallest  zone  of  the  Northern  Territory  than 
in  the  average  district  of  the  Provinces,  the  re- 
liability of  estimates  for  all  holdings  will  be 
less  in  the  smallest  zone  than  in  the  average  dis- 
trict .  However,  the  accuracy  requirement  is  also 
less  exacting,  since  coefficients  of  variation  of 
10$  for  most  items  are  acceptable.   It  was  decided 
that  a  l-in-48  overall  sampling  rate  of  small 
holdings  would  be  sufficient  for  the  Subic  Moun- 
tains, where  it  would  produce  a  sample  of  about 
350  small  holdings.  This  rate  was  also  specified 
for  the  other  two  zones,  where  it  will  be  more 
than  sufficient  —  that  is,  it  will  produce  re- 
sults with  an  accuracy  level  above  the  specified 
minimum. 

3.2  Basic  two-stage  sample  plan 

For  the  same  reasons  of  economy  as  prevailed 
in  the  Provinces,  a  two-stage  plan  was  adopted  for 
small  holdings,  with  supplementation  by  the  ap- 
proximately 100  National  Certainty  holdings. 
However,  since  it  was  not  feasible  to  equalize  the 
size  of  PSU's,  variable  sampling  rates  by  size  of 
PSU  were  adopted  at  the  two  stages,  with  a  con- 
stant overall  sampling  rate  for  small  holdings. 
This  plan  helps  to  control  sampling  error  and  to 
equalize  the  interviewing  workload  within  PSU's. 
It  was  decided  that  the  average  number  of  small 
holdings  within  selected  PSU's  should  be  somewhat 
smaller  than  the  number  20  used  in  the  districts, 
because  of  the  larger  amount  of  work  to  be  done, 
considering  the  area  measurements.  Therefore,  the 
number  15  was  chosen  as  the  desirable  size  of  sam- 
ple of  small  holdings  within  the  average  PSU  in 
the  Northern  Territory. 

PSU's  were  classified  into  five  tentatively 
defined  population  size  groups,  as  shown  in  the 
table  on  the  following  page .  The  table  also  shows 
the  expected  average  population  and  the  expected 
average  number  of  holdings  in  each  size  category. 
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Average 

Category 

Population 

Average 

number  of 

size  group 

population 

holdings 

(estimated) 

I 

Less  than  300 

160 

30 

II 

300  to  599 

400 

70 

III 

600  to  999 

700 

130 

IV 

1,000  to  3,499 

1,400 

250 

V 

3 , 500  and  over 

3,700 

700 

After  some  trial-and-error  experimentation, 
it  was  found  that  the  following  rates,  having  con- 
venient intervals,  would  give  approximately  the 
desired  number  of  sample  small  holdings  on  the 
average : 


Sampling  rate 

Category 

For 
PSU's 

Within- 

PSU 

Overall 

I 

1  in  24 

1  in  2 

1  in  48 

II 

1  in  12 

1  in  4 

1  in  48 

III 

1  in  6 

1  in  8 

1  in  48 

IV 

1  in  3 

1  in  16 

1  in  48 

V 

1  in  1 

1  in  48 

L  in  48 

The  principle  used  in  setting  the  tentative 
limits  of  the  population  size  groups  was  that  we 
wish  to  have  a  small  frequency  in  the  largest 
population  size  groups  in  order  to  reduce  the  task 
of  listing  these  large  PSU's.  The  other  class 
limits  and  sampling  rates  were  determined  very 
roughly  by  the  requirement  of  keeping  the  within- 
PSU  sample  around  15.  For  example,  with  a  l-in-2 
rate,  we  need  a  PSU  of  average  population  size  of 
160  (or  an  average  of  30  holdings)  to  get  a  sample 
of  15  holdings.  The  PSU's  of  less  than  300  popu- 
lation have  about  160  persons  on  the  average . 

Later  it  was  necessary  to  modify  these  limits 
to  those  shown  in  exhibit  IV-5-4.  The  final  lim- 
its vary  in  the  different  zones ;  they  were  set 
where  they  would  give  frequencies  in  the  larger 
population  size  groups,  equal  to  or  simple  mul- 
tiples of  the  interval  for  sampling  PSU's.  For 
example,  by  changing  the  lower  limit  of  group  TV 
in  the  Atbara  River  Basin  from  1,000  to  1,500,  we 
obtain  a  frequency  of  3  PSU's  in  the  group  (1,500 
to  3,499  population),  which  facilitates  selection 
of  1  in  3  PSU's. 


This  sample  plan  will  produce  a  total  sample 
distributed  as  follows  among  the  three  zones  (see 
also  exhibit  IV-5-4) . 


Zone 

Sample 
PSU's 

Small 

holdings 

in  sample 

National 

Certainty 

holdings 

Total 

Atbara  River 

101 

44 
33 
24 

1,460 

610 
495 
355 

105 
60 

Subic  Mountains . . 

30 
15 

3.3  Three-stage  sample  for  area  measurements 

Area  measurements  will  be  made  in  a  system- 
atically selected  one -fourth  of  all  sample  small 
holdings  in  the  Northern  Territory.   It  is  assumed 
that  area  data  can  be  obtained  directly  from  the 
respondent  for  National  Certainty  holdings . 

3.4  National  Certainty  holdings 

The  list  of  about  100  National  Certainty  hold- 
ings will  be  compiled  by  the  district  marshals 
following  procedures  similar  to  those  used  in  the 
Provinces .  The  list  will  thus  include  large  hold- 
ings located  anywhere  in  the  Northern  Territory, 
and  not  limited  to  sample  PSU's.  Questionnaires 
will  not  be  sent  by  mail  to  NC  holdings,  as  was 
done  in  the  Provinces . 

4.    SELECTING  THE  SAMPLE  OF  SMALL 
HOLDINGS 

The  steps  required  to  carry  out  the  selection 
of  the  sample  according  to  the  above  general  plan 
include : 

(1)  Listing  and  stratifying  PSU's 

(2)  Selecting  the  first-stage  units  (PSU's) 

(3)  Listing  and  stratifying  holdings 

(4)  Selecting  the  second-stage  units  (small 
holdings) 

(5)  Selecting  the  subsample  of  small  holdings. 

4.1    Listing  and  stratifying  PSU's 

The  first-stage  sampling  frame  for  the  North- 
ern Territory  is  a  list  of  all  the  chiefdoms  and 
Special  EA's  (government  stations,  missions,  mining 
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camps,  fishing  and  trading  centers) .  The  list  re- 
sulting from  the  1970  Population  Census  for  the 
Northern  Territory  will  he  the  has is  for  the  agri- 
culture census  sample .  The  lists  for  the  Popula- 
tion Census  were  derived  from  various  administrative 
lists  in  the  Territory  plus  additional  chiefdoms 
found  by  the  Population  Census  supervisors  which 
were- not  included  in  any  administrative  record. 

The  most  up-to-date  list  of  chiefdoms  is  likely 
to  he  the  one  maintained  by  the  tax  authorities . 
However,  the  Agrostan  Government  sponsors  a  number 
of  activities  for  the  Territory.  Several  agencies 
carry  out  work  even  in  the  very  isolated  areas,  and 
all  have  lists  of  chiefdoms  and  special  areas  under 
their  jurisdiction.   These  lists  should  be  compared 
with  the  Population  Census  list.  Any  chief dom  or 
special  area  not  on  the  latter  list  should  be  added 
to  it. 

Perhaps  the  greatest  improvement  in  the  sam- 
pling frame  can  be  made  by  the  Agriculture  Census 
supervisors  in  their  preparatory  work  for  the  cen- 
sus .  The  supervisors  will  be  hired  a  few  weeks 
early  and  will  be  required  to  canvass  the  area 
assigned  to  them,  to  plot  the  location  of  the  sam- 
ple chiefdoms  and  Special  EA's  on  topographical 
maps  or  sketches,  and  to  spot  the  National  Cer- 
tainty holdings  on  the  map.  Only  after  using  all 
reasonable  sources  of  information,  including  the 
canvass  by  the  census  supervisors,  should  the  list 
of  chiefdoms  be  considered  acceptable  as  a  sampling 
frame. 

The  stratification  within  the  zones  of  the 
Northern  Territory  will  be  carried  out  in  a  man- 
ner similar  to  that  in  the  Provinces,  using  PSU 
cards.  However,  the  cards  will  first  be  sorted 
into  the  population  size  categories  described  above 
(see  exhibits  17-5-1,  17-5-2,  and  IV-5-3) .   In 
sorting,  the  original  geographic  order  should  be 
maintained  within  categories. 

The  precise  population  of  some  of  the  chief- 
doms might  not  be  known.   If  so,  a  judgment  should 
be  made  of  the  proper  size  category.   If  some  PSU's 
are  incorrectly  classified  by  size,  the  effect  on 


the  reliability  of  the  sample  will  not  be  very 
serious  unless  it  happens  often.   Within  each 
size  category,  the  PSU's  should  be  ordered  in 
geographic  sequence  or  possibly  by  using  addition- 
al criteria  such  as  those  used  in  the  Provinces . 
It  may  be  noted  that  this  stratification  process 
is  different  from  that  used  in  the  Provinces .  The 
latter  did  not  require  the  formation  of  definite 
strata  or  groups;  the  PSU's  were  simply  placed  in 
a  list  with  similar  PSU's  listed  near  each  other. 
The  use  of  strata  consisting  of  definite  groups 
in  the  Northern  Territory  is  necessary  because  the 
PSU's  are  chiefdoms  and  Special  EA's  of  highly 
variable  sizes  and  they  will  be  sampled  at  variable 
rates.  As  mentioned  in  the  description  of  the 
basic  sample  plan,  the  holdings  within  these  groups 
will  also  be  sampled  at  variable  rates  so  that  the 
overall  sampling  rate  will  be  constant.  Since  the 
Northern  Territory  will  have  no  Local  Certainty 
holdings,  the  PSU's  can  be  selected  at  the  variable 
rates  without  increasing  the  number  of  weights 
required  to  make  estimates . 

4.2   Selecting  the  PSU's 

The  five  different  size  categories  for  the 
PSU's  of  the  Northern  Territory  and  the  sampling 
rates  are  given  below.     A  systematic  sample  will 
be  selected,   using  a  different  random  start  in 
each  category,   but  continuing  from  zone  to  zone 
within  a  category.     A  worksheet  should  be  prepared 
similar  to  the  one  used  in  the  selection  of  the 
sample  PSU's  in  the  Provinces   (refer  to  exhibit 
IV-3-8  omitting  columns  5  and  6) . 


Category 

Sampling  rate 
for  PSU's 

I 

1  in  24 

II 

1  in  12 

III 

1  in  6 

IV 

1  in  3 

V 

1  in  1 

An  alternative  way  of  classifying  the  chiefdoms  would 
oe   to  have  four  descriptive  groups,    such  as   small,   medium, 
large,    and  very  large.      The  gains  from  this  type  of   stra- 
tification should  be  nearly  equal  to  those  gains  from 
stratification  with  known  population  sizes. 
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4.3  Listing  and  stratifying  holdings 

As  in  the  Provinces,  the  small  holdings  within 
the  sample  PSU's  in  the  Northern  Territory  will  he 
listed  and  stratified;  however,  the  criteria  for 
stratification  and  the  sampling  procedure  will  be 
different.  See  exhibit  IV-5-6  (illustration  of  a 
filled  Listing  Book  for  a  sample  PSU  in  the  North- 
ern Territory)  and  exhibit  IV-5-5  (sketch  of  the 
location  of  the  housing  units  in  the  PSU) . 

Special  care  must  be  taken  in  the  listing  of 
holdings  within  sample  PSU's  in  the  Territory. 
Chief dom  boundaries  are  not  well  defined  and  hous- 
ing units  which  appear  to  be  just  outside  the 
boundaries  might  in  fact  belong  to  the  chief dom. 
In  addition,  some  social  customs  of  the  Territory 
complicate  the  identification  of  holdings .  Nomads 
may  also  present  difficulties .  They  will  be  sam- 
pled where  their  housing  units  are  located  at  the 
time  of  enumeration. 

One  major  difference  in  listing  in  the  Ter- 
ritory as  compared  with  the  Provinces  is  reflected 
on  the  listing  sheet  (exhibit  IV-5-6)  for  the 
Territory.  The  size  categories  which  were  used 
to  stratify  the  holdings  in  the  Provinces  have 
been  omitted  from  the  listing  sheet  of  the  Terri- 
tory because  the  holders  are  not  expected  to  be 
able  to  provide  information  or  size.  Instead,  in- 
formation is  obtained  on  number  of  livestock — a 
figure  which  the  respondents  can  usually  provide. 
Numbers  of  cattle,  sheep  and  goats  are  recorded  in 
columns  8  and  9,  the  holdings  are  stratified  by 
whether  or  not  they  have  more  than  specified  num- 
bers of  livestock,  and  the  sample  holdings  are 
selected  separately  from  these  categories. 

The  definition  of  holding  and  the  selection 
process  are  also  slightly  different  in  the  North- 
ern Territory  where  there  is  no  minimum  size  of 
holding  and  there  are  no  Local  Certainty  holdings . 

4.4  Selecting  the  holdings  within  sample  PSU's 

The  sample  of  holdings  in  the  Territory  will 
be  selected  by  the  enumerator  rather  than  by  the 
supervisor  (see  section  5  below) .  The  enumerator 


will  visit  all  housing  units  in  a  PSU,  list  the 
holdings  associated  with  each  housing  unit,  record 
the  number  of  cattle,  sheep  and  goats  on  each  hold- 
ing in  columns  8  and  9  of  the  listing  sheet,  and 
separate  the  holdings  with  large  numbers  of  live- 
stock from  those  with  small  numbers  by  filling 
columns  11  and  12. 

To  select  the  sample,  the  enumerator  will  be 
provided  with  two  numbers --a  "start  with"  and  a 
"take  every"  number,  shown  for  "PSU  sampling"  in 
item  (a)  on  the  cover  of  the  Listing  Book  (see 
exhibit  IV-5-6) .  Both  of  these  numbers  might  vary 
from  PSU  to  PSU.  The  enumerator  will  count  the 
holdings  with  large  numbers  of  livestock  ("X"  in 
column  11)  until  he  reaches  his  "start  with"  num- 
ber, which  will  be  the  first  sample  holding.  Then 
the  sample  interval,  or  "take  every"  number,  is 
added  repeatedly  to  identify  succeeding  sample 
holdings  of  this  type  shown  in  column  11.  When 
the  end  of  this  category  is  reached,  the  sampling 
continues  to  the  other  category  ("X"  in  column  12) 
until  that  category  has  been  sampled. 

4.5   Selecting  the  sub  sample  for  area 
measurement 

After  selecting  the  holdings,  the  enumerator 
will  select  the  subsample  of  holdings  for  area 
measurement.  The  original  sample  holdings  will 
be  shown  by  the  questionnaire  numbers  assigned  to 
them  in  columns  14-  and  15  of  the  Listing  Book. 
The  subsample  of  holdings  is  selected  by  taking 
one  of  the  first  four  sample  holdings  randomly 
and  then  selecting  every  fourth  sample  holding 
thereafter.  Instructions  for  making  the  selection 
will  be  given  for  "area  measurement  sampling"  in 
item  (a)  on  the  cover  of  the  Listing  Book  for  each 
PSU. 

5.    ENUMERATION  PLAN 

As  was  indicated  above,  conditions  in  the 
Territory  are  such  that  the  enumeration  plan 
used  for  the  Provinces  requires  substantial  modi- 
fication for  use  in  the  Northern  Territory. 
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5.1   Interviewing  and  supervision 

There  will  be  8  or  9  supervisors,  each  in 
charge  of  4  or  5  enumerators .  Due  to  the  unfavor- 
able travel  conditions,  the  supervisor  will  be  un- 
able to  plan  the  time  of  his  visits  to  the  enumer- 
ator so  that  he  can  select  the  sample  as  in  the 
Provinces .  There  could  be  serious  delays  in  com- 
pleting the  field  work  if  the  enumerators  were 
required  to  wait  until  the  supervisor  was  avail- 
able to  select  the  sample  holdings .  The  supervisor 
will  review  the  sample  selection  in  every  PSU, 
however,  and  will  reselect  the  sample  in  those 
cases  where  the  proper  sampling  procedures  were 
not  followed. 

The  enumerator  will  be  assigned  a  single  PSU 
( chief dom  or  Special  EA)  at  a  time.  He  will  visit 
nis  sample  area,  list  the  holdings,  and  select  the 
sample  holdings,  using  the  sampling  instructions 
for  his  PSU.  The  supervisor  will  visit  the  enu- 
merator as  often  as  possible  but,  because  of 
travel  difficulties,  he  may  not  always  be  able  to 
follow  a  predetermined  schedule.  Since  PSU's  will 
not  be  made  of  equal  size  as  in  the  Provinces,  the 
time  needed  for  listing  will  vary  greatly  for  the 
PSU's  in  the  Northern  Territory.  The  sample  design 
tends  to  equalize  the  interviewing  workload  but 
not  the  listing  workload. 

The  National  Certainty  holdings  will  be  inter- 
viewed generally  by  the  enumerator  closest  to  the 
holding.  Because  of  poor  communication  systems, 
the  questionnaires  will  not  be  mailed  to  the  hold- 
ers as  in  the  Provinces.  For  the  area  of  cropland, 
the  information  will  be  obtained  by  a  direct  ques- 
tion, since  it  is  assumed  that  holders  of  large 
tracts  of  land  would  be  able  to  supply  such  infor- 
mation (objective  measurement  will  not  be  applied 
to  the  National  Certainty  holdings). 

Since  the  enumeration  plan  requires  the  enu- 
merator to  work  independently,  a  careful  review  of 
the  enumerator's  work  by  the  supervisor  is  most  im- 
portant .   If  the  review  indicates  that  the  enumera- 
tor is  doing  good  work,  a  second  PSU  or  several  Na- 
tional Certainty  holdings  may  be  assigned  to  him. 


5.2  Objective  measurement  of  area 

Objective  measurement  of  the  crop  area  in  the 
holding  will  be  carried  out  at  a  randomly  selected 
one-fourth  of  the  sample  small  holdings  in  the 
Northern  Territory.   In  order  that  the  measure- 
ments can  be  done  quickly  and  with  reasonable 
accuracy,  a  simple  method  of  making  the  measure- 
ments is  needed. 

5.21  Making  the  sketch  of  cropland  in  the 
holding. — The  greatest  difficulty  in  measuring  an 
area  objectively  is  in  preparing  an  accurate  sketch 
of  the  land.  After  a  sketch  is  made,  the  number 
of  hectares  (or  other  unit)  can  be  determined  in 
several  ways . 

In  the  Territory,  the  holding  will  often  be 
divided  into  many  plots  of  irregular  shapes .  Each 
of  the  plots  must  be  sketched  separately  and  the 
crop  areas  clearly  outlined  and  labelled  to  permit 
identification  of  the  five  important  crops  sepa- 
rately (wheat,  rice,  cotton,  coffee,  and  sisal). 
The  sketch  will  be  made  on  the  graph  portion  of 
page  4-  of  the  Agriculture  Questionnaire  for  the 
Northern  Territory.  The  graph  is  divided  into 
equal-sized  squares --representing  20  meters  on 
each  side;  each  square  contains  16  dots  to  guide 
in  determining  fractional  parts  of  a  square.  The 
only  equipment  needed  will  be  a  reasonably  accurate 
compass,  measuring  chain,  and  a  dozen  or  more 
marking  poles.  The  poles  should  be  red  or  some 
other  highly  visible  color,  although  some  of  the 
enumerators  will  find  it  necessary  to  cut  poles 
from  nearby  bush. 

The  enumerator  will  begin  the  measurement  at 
one  corner  of  the  sample  holding.  Using  the  com- 
pass, chain,  and  measuring  poles,  he  will  fit  a 
rectangle  to  include  as  much  of  the  crop  area  as 
possible  and  sketch  the  rectangle  onto  the  paper. 
With  the  sides  of  the  rectangle  as  a  base  line,  he 
will  sketch  the  boundaries  of  the  area  by  measuring 
the  distance  between  the  base  line  and  the  boundary 
at  intervals  of  20  meters  and  marking  the  measured 
points  on  the  sketch.  The  points,  so  marked  off  on 
the  sketch  will  be  joined  with  straight  lines  which 
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will  approximate  the  area  boundaries .  The  Enumer- 
ator' s  Reference  Manual  (Northern  Territory),  Form 
A-51,  explains  the  area  measurement  technique  in 
greater  detail. 

When  the  crop  area  consists  of  separate  plots, 
the  enumerator  should  measure  and  sketch  each  plot 
separately. 

5.22  Determining  the  area. — The  actual  com- 
putation of  the  area  will  be  done  by  the  staff  in 
the  Central  Office  of  the  NSO.  The  number  of  com- 
plete squares  within  the  boundaries  will  be  counted 
and  multiplied  by  the  known  area  represented  by 
each  square.  For  squares  which  are  only  partially 
inside  the  boundaries,  the  area  will  be  determined 
by  estimating  the  proportion  of  dots  inside  the 
boundaries,  multiplying  the  proportion  by  the 
known  area  of  the  square,  summing  over  all  partial 
squares  and  adding  it  to  the  area  included  in  whole 
squares.  The  procedure  will  be  followed  for  all 
plots  in  the  holding.  Finally,  the  area  of  the 
plots  will  be  added  together  for  the  total  crop 
area  of  the  holding. 

If  the  squares  of  the  rectangle  are  of  a  suit- 
able size,  the  method  could  be  used  to  make  esti- 
mates of  yield.  A  sample  of  the  squares  from  each 
crop  could  be  selected  and  staked  out  early  in  the 
growing  season  for  measurement  at  harvest  time. 
Yield  estimates  could  then  be  made  for  the  Terri- 
tory or  for  the  zones,  but  not  for  individual 
holdings .  Estimation  of  yields  is  not  recommended 
as  a  census  activity,  however,  due  to  the  heavy 
costs  of  making  several  visits  to  the  plot  during 
the  year.  Data  on  yields  should  preferably  be 
collected  in  the  annual  agriculture  statistics 
program . 

6e    PREPARATION  OF  ESTIMATES 

Estimates  in  the  Northern  Territory  will  be 
made — after  adjusting  for  non -interviews — by 
weighting  the  sample  results  for  the  small  hold- 
ings by  48  and  adding  the  results  for  the  National 
Certainty  holdings .  The  area  in  cropland ,  which 
will  be  measured  for  a  fourth  of  the  small  hold- 
ings, will  be  obtained  by  weighting  the  appropriate 


entries  from  the  subsample  by  192  and  adding  the 
corresponding  entries  for  the  National  Certainty 
holdings .  The  estimation  method  is  described  in 
chapter  6  of  this  unit.  The  first-stage  ratio 
adjustment  described  in  that  chapter  may  also  be 
used  in  the  Northern  Territory. 

Census  objectives  for  characteristics  invol- 
ving area  will  be  quite  limited.  The  distribution 
of  the  holdings  by  size  of  crop  area  can  be  ob- 
tained for  each  of  the  zones.  Any  cross-classi- 
fication of  size  data  with  tenure  or  other  char- 
acteristics should  be  avoided  at  the  zone  level. 
However,  such  data  should  be  of  sufficient  relia- 
bility for  the  Territory  as  a  whole  if  the  num- 
ber of  cross -classifications  is  held  to  a  mini- 
mum. 

Variance  estimates  can  be  made  using  the 
procedures  described  in  chapter  7  of  this  unit. 

7.    SUMMARY 

Estimates  for  each  of  the  three  zones  in  the 
Northern  Territory  are  required.  The  sample  for 
the  Territory  was  designed  so  that  most  estimates 
could  be  made  with  no  more  than  about  a  10$  co- 
efficient of  variation.  The  sample  holdings  will 
be  selected  from  sample  PSU's  ( chief doms  and 
Special  EA's).  The  PSU's  will  be  selected  with 
different  probabilities  depending  on  their  popu- 
lation. Holdings  will  also  be  selected  at  vari- 
able rates  so  that  the  overall  sampling  rate  is 
the  same  for  every  holding  except  the  National 
Certainty  holdings.  A  total  of  about  1,500  small 
holdings  and  about  100  National  Certainty  holdings 
will  be  selected  in  the  sample.  There  will  be  no 
Local  Certainty  holdings . 

Objective  measurement  of  the  area  of  crop- 
land in  the  holding  will  be  carried  out  for  a 
subsample  (one-fourth)  of  the  small  holdings. 
Area  in  cropland  will  be  reported  for  all  the 
National  Certainty  holdings,  but  will  not  be  meas- 
ured objectively. 

Exhibit  IY-5-4  summarizes  the  sample  size  for 
the  Northern  Territory. 
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1.    INTRODUCTION 

When  the  enumeration  and  area  measurements 
have  been  completed,  and  all  the  questionnaires 
checked  in  and  edited,  there  will  be  some  sample 
holdings  for  which  no  usable  questionnaires  were 
obtained.  This  chapter  describes  a  method  of  ad- 
justing the  sample  for  such  "noninterviews "  and 
for  questionnaires  with  missing  area  measurements. 
The  noninterview  adjustment  does  not  apply  to  in- 
dividual missing  or  inconsistent  entries  in  the 
questionnaire,  which  in  some  cases  are  corrected 
by  editing  or  imputation  (supplying  the  data  ac- 
cording to  prescribed  rules)  and  in  other  cases 
are  allowed  to  appear  in  a  "Not  reported"  category. 

This  chapter  also  describes  a  simple  method  of 
expanding  the  adjusted  sample  results  to  obtain 
estimates  for  the  entire  universe  of  holdings. 
Since  three  probabilities  of  selection  were  used 
in  the  Provinces,  there  are  three  expansion  fac- 
tors (or  weights)  to  be  used  in  making  estimates 
for  the  districts  and  the  Provinces.  Similarly, 
there  are  three  weights  for  the  Northern  Territory. 

2.    NONINTER  VIE  W  ADJUSTMENT 

In  an  agriculture  census,  it  is  practically 
impossible  to  obtain  interviews  from  every  hold- 
ing which  is  specified  for  enumeration.  Unless 
corrected  for,  the  noninterviewed  cases  could 
cause  a  serious  bias,  especially  in  estimates  of 
totals  and  aggregates.  If  the  number  of  all  such 
cases  is  not  more  than  about  5%   (10/6  at  the  very- 
most),  a  simple  noninterview  adjustment  may  give 
satisfactory  results. 

2.1    Types  of  noninterviews 

A  holding  may  be  a  "noninterview"  for  a  num- 
ber of  reasons.  The  noninterview  adjustment  is 
limited  to  those  holdings  for  which  Agriculture 


Questionnaires  would  have  been  filled  if  a  com- 
pleted interview  had  been  obtained.  These  are 
cases  which  are  reported  as: 

(1)  No  one  at  home  at  time  of  call 

(2)  Temporarily  absent  for  a  period  of  time 

(3)  Refusal 

(4)  No  qualified  respondents  available. 

There  may  also  be  a  few  cases  where  the  question- 
naire is  lost,  destroyed,  or  so  poorly  filled  in 
that  it  is  not  usable.  These  are  also  treated  as 
noninterviews  if  it  is  determined  that  a  return 
visit  would  be  impractical. 

2.2  Adjustment  for  noninterviewed  small 
holdings 

To  adjust  for  noninterviews  of  sample  small 
holdings,  data  for  an  interviewed  small  holding  in 
the  same  PSU  and  preferably  in  the  same  size-of- 
holding  group  (as  shown  in  the  Listing  Book)  will  be 
substituted  for  the  noninterviewed  holding.  Suppose, 
for  example,  the  fourth  sampled  holding  shown  in  ex- 
hibit IV-4-1  is  reported  as  a  noninterview  because 
there  was  no  one  at  home .  We  select  the  next  sample 
holding  in  the  same  size  category  (the  13^n  sample 
holding  in  exhibit  IV-4-1)  and  duplicate  it,  either 
by  making  up  an  identical  questionnaire  or  by  dupli- 
cating the  corresponding  cards  for  that  holding. 
This  has  the  effect  of  doubling  the  weight  for  the 
holding  which  is  duplicated.   If  a  computer  is  used, 
the  duplication  of  questionnaires  or  cards  is  unnec- 
essary; the  procedure  will  be  to  simply  double  the 
weight  of  the  interviewed  questionnaire.   If  there 
are  two  noninterviews  in  the  same  PSU,  a  second  hold- 
ing (other  than  the  one  already  duplicated)  is  selec- 
ted in  a  similar  way,  and  this  holding  is  duplicated . 

A  slight  variation  in  this  procedure  for 
small  holdings  is  necessary  in  the  Northern  Ter- 
ritory, where  every  fourth  sample  small  holding 
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is  specified  for  area  measurements .  To  keep  the 
area  measurement  sample  self -weighted,  a  noninter- 
view  among  the  holdings  specified  for  area  measure- 
ments will  require  duplication  of  another  holding 
having  area  measurements.  Similarly,  for  holdings 
not  requiring  area  measurements,  interviewed  hold- 
ings without  area  measurements  should  be  substi- 
tuted for  noninterviewed  cases . 

2.3  Adjustment  for  noninterviewed  certainty 
holdings 

As  described  earlier,  a  greater  effort  will 
be  made  to  obtain  census  data  from  Local  Certainty- 
holdings,  and  especially  from  National  Certainty 
holdings.  No  doubt  there  will  be  some  noninter- 
views  remaining  in  both  categories. 

For  Local  Certainty  holdings  the  noninterview 
adjustment  will  be  made  in  the  same  way  as  for  small 
holdings — by  duplicating  another  Local  Certainty 
holding  in  the  same  size  group.  Where  possible  the 
substitute  holding  will  be  in  the  same  PSU,  but  it 
may  be  necessary  to  use  an  interviewed  Local  Cer- 
tainty holding  from  another  PSU  in  the  same  district, 
In  either  case,  the  selection  should  be  made  accord- 
ing to  a  predetermined  procedure,  such  ej  taking  the 
next  numbered  Local  Certainty  holding  in  the  same 
PSU  or,  where  necessary,  in  the  next  numbered  PSU. 

For  National  Certainty  holdings,  we  can  ex- 
pect that  there  will  be  very  few  interviews  avail- 
able for  duplication  in  the  same  PSU  or  even 
within  the  same  district.  The  interviewed  Nation- 
al Certainty  holdings  may  also  be  very  different 
from  those  missed.  Therefore,  duplication  will 
usually  not  be  feasible.  Instead,  data  for  the 
questionnaire  will  be  imputed  on  the  basis  of  the 
best  available  data. 

3.    ADJUSTMENT  FOR  MISSING  AREA 
MEASUREMENTS 

In  some  cases  in  the  Northern  Territory,  the 
interview  portion  of  a  questionnaire  will  be  usable 
but  the  area  measurement  information  will  be  missing 
or  not  usable.  These  will  be  treated  the  same  as 
other  cases  of  individual  missing  or  not  usable 


entries.  The  area  measurement  data  will  be  im- 
puted on  the  basis  of  the  best  information  avail- 
able, and  the  imputed  entries  should  be  consis- 
tent with  production  data  for  the  corresponding 
crops  in  Section  IV  of  the  same  questionnaire. 

4.    IDENTIFICATION  OF  DUPLICATE  CARDS 

Duplicate  cards  will  be  punched,  or  double 
weights  assigned,  primarily  for  noninterviews. 
Another  possible  reason  is  to  compensate  for  un- 
usual subsampling  rates  used  for  small  holdings 
in  PSU' s  having  a  very  large  number  of  small  hold- 
ings. This  would  require  duplicating,  or  at  most 
triplicating,  all  the  small  holdings  in  a  PSU, 
since  subsampling  rates  in  those  cases  are  either 
1/2  or  1/3  of  the  usual  rate.   (See  section  4.3 
in  chapter  U   of  Unit  IV.)   Replication  eliminates 
the  need  for  additional  weight  groups  in  these 
cases. 

When  a  card  is  duplicated  (or  a  weight  is 
doubled),  a  code  should  be  punched  in  the  dupli- 
cate card  to  show  that  it  is  a  duplicate  and  to 
indicate  the  reason  for  the  duplication.  Such  a 
code  will  enable  the  NSO  staff  to  analyze  the 
effect  of  the  duplications  on  the  census  results. 

Sometimes  a  different  color  is  used  for 
duplicated  cards  (or  cards  with  a  doubled  weight) 
for  easy  visual  identification  in  the  deck.  A 
code  punched  in  the  card  is  still  necessary,  how- 
ever, in  order  to  sort  the  duplicated  cards  from 
the  main  deck. 

5.    ALTERNATIVE  ADJUSTMENT  PROCEDURES 

The  procedure  outlined  for  handling  nonin- 
terviews was  considered  to  be  the  most  practical 
for  Agrostan,  and  the  simplest  to  carry  out. 
Another  method  of  substitution  is  to  select  the 
holding  to  be  duplicated  at  random,  from  those 
interviewed  in  the  same  size  group  in  the  PSU. 
Any  reasonable  method  of  substitution  may  be  used 
if  it  is  predetermined  and  the  number  of  cases  is 
not  very  large .  If  the  noninterview  rate  is  large , 
no  method  of  substitution  can  be  very  satisfactory. 
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A  second  alternative  is  to  increase  the  weight 
of  all  the  "interviewed"  questionnaires  in  the  PSU 
where  the  noninterview  occurs;  the  weight  would  be 
increased  to  the  extent  needed  to  offset  the  non- 
interviews  in  the  PSU.  In  effect,  this  procedure 
substitutes  average  characteristics  for  the  miss- 
ing data,  which  is  theoretically  more  desirable 
than  multiplying  some  cards  in  the  PSU  by  2  and 
not  multiplying  others.   On  the  other  hand,  the 
procedure  of  changing  the  weight  would  result  in 
many  different  numbers  for  the  weights,  and  would 
complicate  the  tabulations;  it  is  usually  not  fea- 
sible unless  a  computer  is  available  for  process- 
ing the  data. 

6.    POSSIBLE  BIASES  IN  ADJUSTMENT 
PROCEDURES 

Duplication  of  questionnaires  or  cards  to 
adjust  for  noninterviews  will  lead  to  a  sample 
set  of  questionnaires  or  cards  for  the  same  num- 
ber of  holdings  as  if  usable  interviews  were  ob- 
tained from  all  sample  holdings.  However,  some 
serious  problems  may  still  arise  because  of  the 
noninterviews. 

(1)  The  noninterviews  may  be  concentrated  in  cer- 
tain subgroups  of  the  population,  such  as 
holdings  belonging  to  holders  who  are  either 
very  young  or  elderly.  For  such  a  group,  we 
may  be  representing  these  holdings  by  others 
which  have  rather  different  characteristics. 

(2)  If  the  noninterview  rate  (including  lost 
questionnaires)  becomes  more  than  about  5%> 
efforts  should  be  made  to  reduce  it  by  re- 
training enumerators  and  requiring  more  call- 
backs. 

7.    ESTIMATES  FROM  THE  SAMPLE 

In  planning  for  a  census  or  survey,  table 
formats  are  prepared  in  advance  showing  the  kinds 
of  estimates  to  be  made.   (Tables  for  most  of  the 
estimates  which  will  be  made  for  Agrostan  have 
been  prepared  and  are  given  in  the  appendix  of 
Unit  III.)  After  the  cards  have  been  edited, 
coded,  and  duplicated  for  noninterviews,  they  are 
tabulated  and  weighted  to  provide  the  data  for 
the  cells  shown  in  the  table  formats .  From  the 
table  formats  for  Agrostan,  it  can  be  seen  that 


the  required  estimates  are  of  the  following  types : 

(1)  Estimates  of  numbers  of  holdings  and  of 
aggregates 

(2)  Estimates  of  ratios,  percentages,  and  aver- 
ages; and  estimates  of  proportions  of  hold- 
ings having  an  attribute . 

7.1    Estimates  of  numbers  of  holdings  and  of 
aggregates 

In  the  Provinces,  estimates  of  totals  or 
aggregates  (such  as  total  numoer  of  holdings  for  a 
district  or  total  production  of  maize)  will  be  ob- 
tained as  follows :   If  cards  are  duplicated  for 
noninterviews,  the  sample  totals  for  Local  Cer- 
tainty holdings  will  be  multiplied  by  4  and  the 
sample  totals  for  small  holdings  will  be  multiplied 
by  16.   If  weights  of  individual  holdings  were 
adjusted  for  noninterviews,  there  would  be  addi- 
tional weight  groups.  The  weighted  totals  for 
the  two  groups  of  holdings  (the  small  and  the  Lo- 
cal Certainty  holdings)  will  be  combined  and  added 
to  the  total  for  the  National  Certainty  holdings 
located  in  the  district.  The  resulting  estimates 
of  district  totals  will  be  added  to  obtain  esti- 
mates for  the  Provinces  as  a  whole. 

Similar  methods  will  be  used  to  obtain  esti- 
mated totals  for  the  zones  in  the  Northern  Ter- 
ritory. However,  the  weight  for  small  holdings 
will  be  48.  The  totals  for  the  National  Certainty 
holdings  will  be  added  to  the  weighted  totals  for 
the  small  holdings  to  obtain  estimated  totals  for 
each  zone.   (There  will  be  no  Local  Certainty  hold- 
ings in  the  Northern  Territory.)  The  subsample 
of  holdings  for  which  area  measurements  are  made 
will  have  a  special  weight  for  these  measurements, 
which  is  4  x  48  or  192. 

If  the  tabulation  is  done  on  an  electronic 
computer  the  weighting  can  be  accomplished  as 
part  of  the  overall  tabulation  process.  On  con- 
ventional tabulating  equipment  the  National  Cer- 
tainty holdings,  the  Local  Certainty  holdings, 
and  the  small  holdings  would  all  be  tabulated  and 
weighted  separately  for  each  cell  in  the  tables, 
and  the  weighted  totals  for  the  different  types  of 
holdings  would  be  added  together,  cell  by  cell. 
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Average 

Category 

Population 
size  group 

Average 
population 

number  of 

holdings 

(estimated) 

I 

Less  than  300 

160 

30 

II 

300  to  599 

400 

70 

III 

600  to  999 

700 

130 

IV 

1,000  to  3,499 

1,400 

250 

V 

3 , 500  and  over 

3,700 

700 

After  some  trial -and -error  experimentation, 
it  was  found  that  the  following  rates,  having  con- 
venient intervals,  would  give  approximately  the 
desired  number  of  sample  small  holdings  on  the 
average : 


Sampling  rate 

Category 

For 
PSU's 

Within- 
PSU 

Overall 

I 

II 
III 
IV 

V 

1  in  24 

1  in  12 

1  in  6 

1  in  3 

1  in  1 

1  in  2 

1  in  4 

1  in  8 

1  in  16 

1  in  48 

1  in  48 
1  in  48 
1  in  48 
1  in  48 
1  in  48 

The  principle  used  in  setting  the  tentative 
limits  of  the  population  size  groups  was  that  we 
wish  to  have  a  small  frequency  in  the  largest 
population  size  groups  in  order  to  reduce  the  task 
of  listing  these  large  PSU's.  The  other  class 
limits  and  sampling  rates  were  determined  very 
roughly  by  the  requirement  of  keeping  the  within- 
PSU  sample  around  15.  For  example,  with  a  l-in-2 
rate,  we  need  a  PSU  of  average  population  size  of 
160  (or  an  average  of  30  holdings)  to  get  a  sample 
of  15  holdings.  The  PSU's  of  less  than  300  popu- 
lation have  about  160  persons  on  the  average . 

Later  it  was  necessary  to  modify  these  limits 
to  those  shown  in  exhibit  IV-5-4.  The  final  lim- 
its vary  in  the  different  zones ;  they  were  set 
where  they  would  give  frequencies  in  the  larger 
population  size  groups,  equal  to  or  simple  mul- 
tiples of  the  interval  for  sampling  PSU's.  For 
example,  by  changing  the  lower  limit  of  group  TV 
in  the  Atbara  River  Basin  from  1,000  to  1,500,  we 
obtain  a  frequency  of  3  PSU's  in  the  group  (1,500 
to  3,499  population),  which  facilitates  selection 
of  1  in  3  PSU's. 


This  sample  plan  will  produce  a  total  sample 
distributed  as  follows  among  the  three  zones  (see 
also  exhibit  IV-5-4) . 


Zone 

Sample 
PSU's 

Small 

holdings 

in  sample 

National 

Certainty 

holdings 

Total 

Atbara  River 

101 

44 
33 
24 

1,460 

610 
495 
355 

105 
60 

Subic  Mountains . . 

30 
15 

3.3  Three-stage  sample  for  area  measurements 

Area  measurements  will  be  made  in  a  system- 
atically selected  one -fourth  of  all  sample  small 
holdings  in  the  Northern  Territory.   It  is  assumed 
that  area  data  can  be  obtained  directly  from  the 
respondent  for  National  Certainty  holdings . 

3.4  National  Certainty  holdings 

The  list  of  about  100  National  Certainty  hold- 
ings will  be  compiled  by  the  district  marshals 
following  procedures  similar  to  those  used  in  the 
Provinces .  The  list  will  thus  include  large  hold- 
ings located  anywhere  in  the  Northern  Territory, 
and  not  limited  to  sample  PSU's.  Questionnaires 
will  not  be  sent  by  mail  to  NC  holdings,  as  was 
done  in  the  Provinces . 

4.    SELECTING  THE  SAMPLE  OF  SMALL 
HOLDINGS 

The  steps  required  to  carry  out  the  selection 
of  the  sample  according  to  the  above  general  plan 
include : 

(1)  Listing  and  stratifying  PSU's 

(2)  Selecting  the  first-stage  units  (PSU's) 

(3)  Listing  and  stratifying  holdings 

(4)  Selecting  the  second-stage  units  (small 
holdings) 

(5)  Selecting  the  subsample  of  small  holdings. 

4.1    Listing  and  stratifying  PSU's 

The  first-stage  sampling  frame  for  the  North- 
ern Territory  is  a  list  of  all  the  chiefdoms  and 
Special  EA's  (government  stations,  missions,  mining 
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camps,  fishing  and  trading  centers).  The  list  re- 
sulting from  the  1970  Population  Census  for  the 
Northern  Territory  will  be  the  basis  for  the  agri- 
culture census  sample.  The  lists  for  the  Popula- 
tion Census  were  derived  from  various  administrative 
lists  in  the  Territory  plus  additional  chiefdoms 
found  by  the  Population  Census  supervisors  which 
were- not  included  in  any  administrative  record. 

The  most  up-to-date  list  of  chiefdoms  is  likely 
to  be  the  one  maintained  by  the  tax  authorities . 
However,  the  Agrostan  Government  sponsors  a  number 
of  activities  for  the  Territory.  Several  agencies 
carry  out  work  even  in  the  very  isolated  areas,  and 
all  have  lists  of  chiefdoms  and  special  areas  under 
their  jurisdiction.   These  lists  should  be  compared 
with  the  Population  Census  list.  Any  chief dom  or 
special  area  not  on  the  latter  list  should  be  added 
to  it . 

Perhaps  the  greatest  improvement  in  the  sam- 
pling frame  can  be  made  by  the  Agriculture  Census 
supervisors  in  their  preparatory  work  for  the  cen- 
sus .  The  supervisors  will  be  hired  a  few  weeks 
early  and  will  be  required  to  canvass  the  area 
assigned  to  them,  to  plot  the  location  of  the  sam- 
ple chiefdoms  and  Special  EA's  on  topographical 
maps  or  sketches,  and  to  spot  the  National  Cer- 
tainty holdings  on  the  map.  Only  after  using  all 
reasonable  sources  of  information,  including  the 
canvass  by  the  census  supervisors,  should  the  list 
of  chiefdoms  be  considered  acceptable  as  a  sampling 
frame. 

The  stratification  within  the  zones  of  the 
Northern  Territory  will  be  carried  out  in  a  man- 
ner similar  to  that  in  the  Provinces,  using  PSU 
cards.  However,  the  cards  will  first  be  sorted 
into  the  population  size  categories  described  above 
(see  exhibits  IV-5-1,  17-5-2,  and  IV-5-3) .   In 
sorting,  the  original  geographic  order  should  be 
maintained  within  categories . 

The  precise  population  of  some  of  the  chief- 
doms might  not  be  known.   If  so,  a  judgment  should 
be  made  of  the  proper  size  category.   If  some  PSU's 
are  incorrectly  classified  by  size,  the  effect  on 


the  reliability  of  the  sample  will  not  be  very 
serious  unless  it  happens  often.   Within  each 
size  category,  the  PSU's  should  be  ordered  in 
geographic  sequence  or  possibly  by  using  addition- 
al criteria  such  as  those  used  in  the  Provinces . 
It  may  be  noted  that  this  stratification  process 
is  different  from  that  used  in  the  Provinces .  The 
latter  did  not  require  the  formation  of  definite 
strata  or  groups;  the  PSU's  were  simply  placed  in 
a  list  with  similar  PSU's  listed  near  each  other. 
The  use  of  strata  consisting  of  definite  groups 
in  the  Northern  Territory  is  necessary  because  the 
PSU's  are  chiefdoms  and  Special  EA's  of  highly 
variable  sizes  and  they  will  be  sampled  at  variable 
rates .  As  mentioned  in  the  description  of  the 
basic  sample  plan,  the  holdings  within  these  groups 
will  also  be  sampled  at  variable  rates  so  that  the 
overall  sampling  rate  will  be  constant.  Since  the 
Northern  Territory  will  have  no  Local  Certainty 
holdings,  the  PSU's  can  be  selected  at  the  variable 
rates  without  increasing  the  number  of  weights 
required  to  make  estimates . 

4.2   Selecting  the  PSU's 

The  five  different  size  categories  for  the 
PSU's  of  the  Northern  Territory  and  the  sampling 
rates  are  given  below.     A  systematic  sample  will 
be  selected,   using  a  different  random  start  in 
each  category,   but  continuing  from  zone  to  zone 
within  a  category.     A  worksheet  should  be  prepared 
similar  to  the  one  used  in  the  selection  of  the 
sample  PSU's  in  the  Provinces    (refer  to  exhibit 
IV-3-8  omitting  columns  5  and  6) . 


Category 

Sampling  rate 
for  PSU's 

I 

1  in  24 

II 

1  in  12 

III 

1  in  6 

IV 

1  in  3 

V 

1  in  1 

An  alternative  way  of  classifying  the  chiefdoms  would 
be   to  have  four  descriptive  groups,    such  as   small,   medium, 
large,    and  very  large.      The  gains  from  this  type  of   stra- 
tification should  be  nearly  equal  to  those  gains  from 
stratification  with  known  population  sizes. 
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4.3  Listing  and  stratifying  holdings 

As  in  the  Provinces,  the  small  holdings  within 
the  sample  PSU's  in  the  Northern  Territory  will  he 
listed  and  stratified;  however,  the  criteria  for 
stratification  and  the  sampling  procedure  will  he 
different.  See  exhibit  17-5-6  (illustration  of  a 
filled  Listing  Book  for  a  sample  PSU  in  the  North- 
ern Territory)  and  exhibit  IV-5-5  (sketch  of  the 
location  of  the  housing  units  in  the  PSU) . 

Special  care  must  be  taken  in  the  listing  of 
holdings  within  sample  PSU's  in  the  Territory. 
Chief dom  boundaries  are  not  well  defined  and  hous- 
ing units  which  appear  to  be  just  outside  the 
boundaries  might  in  fact  belong  to  the  chief dom. 
In  addition,  some  social  customs  of  the  Territory 
complicate  the  identification  of  holdings .  Nomads 
may  also  present  difficulties.  They  will  be  sam- 
pled where  their  housing  units  are  located  at  the 
time  of  enumeration. 

One  major  difference  in  listing  in  the  Ter- 
ritory as  compared  with  the  Provinces  is  reflected 
on  the  listing  sheet  (exhibit  IV-5-6)  for  the 
Territory.  The  size  categories  which  were  used 
to  stratify  the  holdings  in  the  Provinces  have 
been  omitted  from  the  listing  sheet  of  the  Terri- 
tory because  the  holders  are  not  expected  to  be 
able  to  provide  information  or  size.  Instead,  in- 
formation is  obtained  on  number  of  livestock — a 
figure  which  the  respondents  can  usually  provide . 
Numbers  of  cattle,  sheep  and  goats  are  recorded  in 
columns  8  and  9,  the  holdings  are  stratified  by 
whether  or  not  they  have  more  than  specified  num- 
bers of  livestock,  and  the  sample  holdings  are 
selected  separately  from  these  categories . 

The  definition  of  holding  and  the  selection 
process  are  also  slightly  different  in  the  North- 
ern Territory  where  there  is  no  minimum  size  of 
holding  and  there  are  no  Local  Certainty  holdings . 

4.4  Selecting  the  holdings  within  sample  PSU's 

The  sample  of  holdings  in  the  Territory  will 
be  selected  by  the  enumerator  rather  than  by  the 
supervisor  (see  section  5  below) .  The  enumerator 


will  visit  all  housing  units  in  a  PSU,  list  the 
holdings  associated  with  each  housing  unit,  record 
the  number  of  cattle,  sheep  and  goats  on  each  hold- 
ing in  columns  8  and  9  of  the  listing  sheet,  and 
separate  the  holdings  with  large  numbers  of  live- 
stock from  those  with  small  numbers  by  filling 
columns  11  and  12. 

To  select  the  sample,  the  enumerator  will  be 
provided  with  two  numbers --a  "start  with"  and  a 
"take  every"  number,  shown  for  "PSU  sampling"  in 
item  (a)  on  the  cover  of  the  Listing  Book  (see 
exhibit  IV-5-6) .  Both  of  these  numbers  might  vary 
from  PSU  to  PSU.  The  enumerator  will  count  the 
holdings  with  large  numbers  of  livestock  ("X"  in 
column  11)  until  he  reaches  his  "start  with"  num- 
ber, which  will  be  the  first  sample  holding.  Then 
the  sample  interval,  or  "take  every"  number,  is 
added  repeatedly  to  identify  succeeding  sample 
holdings  of  this  type  shown  in  column  11.  When 
the  end  of  this  category  is  reached,  the  sampling 
continues  to  the  other  category  ("X"  in  column  12) 
until  that  category  has  been  sampled. 

4.5   Selecting  the  sub  sample  for  area 
measurement 

After  selecting  the  holdings,  the  enumerator 
will  select  the  subsample  of  holdings  for  area 
measurement .  The  original  sample  holdings  will 
be  shown  by  the  questionnaire  numbers  assigned  to 
them  in  columns  14-  and  15  of  the  Listing  Book. 
The  subsample  of  holdings  is  selected  by  taking 
one  of  the  first  four  sample  holdings  randomly 
and  then  selecting  every  fourth  sample  holding 
thereafter.  Instructions  for  making  the  selection 
will  be  given  for  "area  measurement  sampling"  in 
item  (a)  on  the  cover  of  the  Listing  Book  for  each 
PSU. 

5.    ENUMERATION  PLAN 

As  was  indicated  above,  conditions  in  the 
Territory  are  such  that  the  enumeration  plan 
used  for  the  Provinces  requires  substantial  modi- 
fication for  use  in  the  Northern  Territory. 
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5.1   Interviewing  and  supervision 

There  will  be  8  or  9  supervisors,  each  in 
charge  of  4  or  5  enumerators .  Due  to  the  unfavor- 
able travel  conditions,  the  supervisor  will  be  un- 
able to  plan  the  time  of  his  visits  to  the  enumer- 
ator so  that  he  can  select  the  sample  as  in  the 
Provinces .  There  could  be  serious  delays  in  com- 
pleting the  field  work  if  the  enumerators  were 
required  to  wait  until  the  supervisor  was  avail- 
able to  select  the  sample  holdings .  The  supervisor 
will  review  the  sample  selection  in  every  PSU, 
however,  and  will  reselect  the  sample  in  those 
cases  where  the  proper  sampling  procedures  were 
not  followed. 

The  enumerator  will  be  assigned  a  single  PSU 
( chief dom  or  Special  EA)  at  a  time.  He  will  visit 
nis  sample  area,  list  the  holdings,  and  select  the 
sample  holdings,  using  the  sampling  instructions 
for  his  PSU.  The  supervisor  will  visit  the  enu- 
merator as  often  as  possible  but,  because  of 
travel  difficulties,  he  may  not  always  be  able  to 
follow  a  predetermined  schedule.  Since  PSU's  will 
not  be  made  of  equal  size  as  in  the  Provinces,  the 
time  needed  for  listing  will  vary  greatly  for  the 
PSU's  in  the  Northern  Territory.  The  sample  design 
tends  to  equalize  the  interviewing  workload  but 
not  the  listing  workload. 

The  National  Certainty  holdings  will  be  inter- 
viewed generally  by  the  enumerator  closest  to  the 
holding.  Because  of  poor  communication  systems, 
the  questionnaires  will  not  be  mailed  to  the  hold- 
ers as  in  the  Provinces.  For  the  area  of  cropland, 
the  information  will  be  obtained  by  a  direct  ques- 
tion, since  it  is  assumed  that  holders  of  large 
tracts  of  land  would  be  able  to  supply  such  infor- 
mation (objective  measurement  will  not  be  applied 
to  the  National  Certainty  holdings). 

Since  the  enumeration  plan  requires  the  enu- 
merator to  work  independently,  a  careful  review  of 
the  enumerator's  work  by  the  supervisor  is  most  im- 
portant .   If  the  review  indicates  that  the  enumera- 
tor is  doing  good  work,  a  second  PSU  or  several  Na- 
tional Certainty  holdings  may  be  assigned  to  him. 


5.2   Objective  measurement  of  area 

Objective  measurement  of  the  crop  area  in  the 
holding  will  be  carried  out  at  a  randomly  selected 
one -fourth  of  the  sample  small  holdings  in  the 
Northern  Territory.   In  order  that  the  measure- 
ments can  be  done  quickly  and  with  reasonable 
accuracy,  a  simple  method  of  making  the  measure- 
ments is  needed. 

5.21  Making  the  sketch  of  cropland  in  the 
holding . — The  greatest  difficulty  in  measuring  an 
area  objectively  is  in  preparing  an  accurate  sketch 
of  the  land.  After  a  sketch  is  made,  the  number 
of  hectares  (or  other  unit)  can  be  determined  in 
several  ways . 

In  the  Territory,  the  holding  will  often  be 
divided  into  many  plots  of  irregular  shapes .  Each 
of  the  plots  must  be  sketched  separately  and  the 
crop  areas  clearly  outlined  and  labelled  to  permit 
identification  of  the  five  important  crops  sepa- 
rately (wheat,  rice,  cotton,  coffee,  and  sisal). 
The  sketch  will  be  made  on  the  graph  portion  of 
page  4  of  the  Agriculture  Questionnaire  for  the 
Northern  Territory.  The  graph  is  divided  into 
equal-sized  squares --representing  20  meters  on 
each  side;  each  square  contains  16  dots  to  guide 
in  determining  fractional  parts  of  a  square.  The 
only  equipment  needed  will  be  a  reasonably  accurate 
compass,  measuring  chain,  and  a  dozen  or  more 
marking  poles.  The  poles  should  be  red  or  some 
other  highly  visible  color,  although  some  of  the 
enumerators  will  find  it  necessary  to  cut  poles 
from  nearby  bush. 

The  enumerator  will  begin  the  measurement  at 
one  corner  of  the  sample  holding.  Using  the  com- 
pass, chain,  and  measuring  poles,  he  will  fit  a 
rectangle  to  include  as  much  of  the  crop  area  as 
possible  and  sketch  the  rectangle  onto  the  paper. 
With  the  sides  of  the  rectangle  as  a  base  line,  he 
will  sketch  the  boundaries  of  the  area  by  measuring 
the  distance  between  the  base  line  and  the  boundary 
at  intervals  of  20  meters  and  marking  the  measured 
points  on  the  sketch.  The  points,  so  marked  off  on 
the  sketch  will  be  joined  with  straight  lines  which 
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will  approximate  the  area  boundaries .  The  Enumer- 
ator 's  Reference  Manual  (Northern  Territory),  Form 
A- 51,  explains  the  area  measurement  technique  in 
greater  detail. 

When  the  crop  area  consists  of  separate  plots, 
the  enumerator  should  measure  and  sketch  each  plot 
separately. 

5.22  Determining  the  area. — The  actual  com- 
putation of  the  area  will  be  done  by  the  staff  in 
the  Central  Office  of  the  NSO.  The  number  of  com- 
plete squares  within  the  boundaries  will  be  counted 
and  multiplied  by  the  known  area  represented  by 
each  square.  For  squares  which  are  only  partially 
inside  the  boundaries,  the  area  will  be  determined 
by  estimating  the  proportion  of  dots  inside  the 
boundaries,  multiplying  the  proportion  by  the 
known  area  of  the  square,  summing  over  all  partial 
squares  and  adding  it  to  the  area  included  in  whole 
squares .  The  procedure  will  be  followed  for  all 
plots  in  the  holding.  Finally,  the  area  of  the 
plots  will  be  added  together  for  the  total  crop 
area  of  the  holding. 

If  the  squares  of  the  rectangle  are  of  a  suit- 
able size,  the  method  could  be  used  to  make  esti- 
mates of  yield.  A  sample  of  the  squares  from  each 
crop  could  be  selected  and  staked  out  early  in  the 
growing  season  for  measurement  at  harvest  time. 
Yield  estimates  could  then  be  made  for  the  Terri- 
tory or  for  the  zones,  but  not  for  individual 
holdings .  Estimation  of  yields  is  not  recommended 
as  a  census  activity,  however,  due  to  the  heavy 
costs  of  making  several  visits  to  the  plot  during 
the  year.  Data  on  yields  should  preferably  be 
collected  in  the  annual  agriculture  statistics 
program . 

6.    PREPARATION  OF  ESTIMATES 

Estimates  in  the  Northern  Territory  will  be 
made—after  adjusting  for  non -interviews — by 
weighting  the  sample  results  for  the  small  hold- 
ings by  48  and  adding  the  results  for  the  National 
Certainty  holdings.  The  area  in  cropland,  which 
will  be  measured  for  a  fourth  of  the  small  hold- 
ings, will  be  obtained  by  weighting  the  appropriate 


entries  from  the  subsample  by  192  and  adding  the 
corresponding  entries  for  the  National  Certainty 
holdings.  The  estimation  method  is  described  in 
chapter  6  of  this  unit.  The  first-stage  ratio 
adjustment  described  in  that  chapter  may  also  be 
used  in  the  Northern  Territory. 

Census  objectives  for  characteristics  invol- 
ving area  will  be  quite  limited.  The  distribution 
of  the  holdings  by  size  of  crop  area  can  be  ob- 
tained for  each  of  the  zones.  Any  cross -classi- 
fication of  size  data  with  tenure  or  other  char- 
acteristics should  be  avoided  at  the  zone  level. 
However,  such  data  should  be  of  sufficient  relia- 
bility for  the  Territory  as  a  whole  if  the  num- 
ber of  cross -classifications  is  held  to  a  mini- 
mum. 

Variance  estimates  can  be  made  using  the 
procedures  described  in  chapter  7  of  this  unit. 

7.    SUMMARY 

Estimates  for  each  of  the  three  zones  in  the 
Northern  Territory  are  required.  The  sample  for 
the  Territory  was  designed  so  that  most  estimates 
could  be  made  with  no  more  than  about  a  10%  co- 
efficient of  variation.  The  sample  holdings  will 
be  selected  from  sample  PSU's  ( chief doms  and 
Special  EA's).  The  PSU's  will  be  selected  with 
different  probabilities  depending  on  their  popu- 
lation. Holdings  will  also  be  selected  at  vari- 
able rates  so  that  the  overall  sampling  rate  is 
the  same  for  every  holding  except  the  National 
Certainty  holdings.  A  total  of  about  1,500  small 
holdings  and  about  100  National  Certainty  holdings 
will  be  selected  in  the  sample .  There  will  be  no 
Local  Certainty  holdings . 

Objective  measurement  of  the  area  of  crop- 
land in  the  holding  will  be  carried  out  for  a 
subsample  (one-fourth)  of  the  small  holdings. 
Area  in  cropland  will  be  reported  for  all  the 
National  Certainty  holdings,  but  will  not  be  meas- 
ured objectively. 

Exhibit  IV- 5-4  summarizes  the  sample  size  for 
the  Northern  Territory. 
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1.    INTRODUCTION 

When  the  enumeration  and  area  measurements 
have  been  completed,  and  all  the  questionnaires 
checked  in  and  edited,  there  will  be  some  sample 
holdings  for  which  no  usable  questionnaires  were 
obtained.  This  chapter  describes  a  method  of  ad- 
justing the  sample  for  such  "noninterviews "  and 
for  questionnaires  with  missing  area  measurements. 
The  noninterview  adjustment  does  not  apply  to  in- 
dividual missing  or  inconsistent  entries  in  the 
questionnaire,  which  in  some  cases  are  corrected 
by  editing  or  imputation  (supplying  the  data  ac- 
cording to  prescribed  rules)  and  in  other  cases 
are  allowed  to  appear  in  a  "Not  reported"  category. 

This  chapter  also  describes  a  simple  method  of 
expanding  the  adjusted  sample  results  to  obtain 
estimates  for  the  entire  universe  of  holdings. 
Since  three  probabilities  of  selection  were  used 
in  the  Provinces,  there  are  three  expansion  fac- 
tors (or  weights)  to  be  used  in  making  estimates 
for  the  districts  and  the  Provinces.  Similarly, 
there  are  three  weights  for  the  Northern  Territory. 

2.    NONINTER  VIE  W  ADJUSTMENT 

In  an  agriculture  census,  it  is  practically 
impossible  to  obtain  interviews  from  every  hold- 
ing which  is  specified  for  enumeration.   Unless 
corrected  for,  the  noninterviewed  cases  could 
cause  a  serious  bias,  especially  in  estimates  of 
totals  and  aggregates.  If  the  number  of  all  such 
cases  is  not  more  than  about  5%   (10/6  at  the  very- 
most),  a  simple  noninterview  adjustment  may  give 
satisfactory  results. 

2.1    Types  of  noninterviews 

A  holding  may  be  a  "noninterview"  for  a  num- 
ber of  reasons.  The  noninterview  adjustment  is 
limited  to  those  holdings  for  which  Agriculture 


Questionnaires  would  have  been  filled  if  a  com- 
pleted interview  had  been  obtained.  These  are 
cases  which  are  reported  as: 

(1)  No  one  at  home  at  time  of  call 

(2)  Temporarily  absent  for  a  period  of  time 

(3)  Refusal 

(4.)  No  qualified  respondents  available. 

There  may  also  be  a  few  cases  where  the  question- 
naire is  lost,  destroyed,  or  so  poorly  filled  in 
that  it  is  not  usable.  These  are  also  treated  as 
noninterviews  if  it  is  determined  that  a  return 
visit  would  be  impractical. 

2.2  Adjustment  for  noninterviewed  small 
holdings 

To  adjust  for  noninterviews  of  sample  small 
holdings,  data  for  an  interviewed  small  holding  in 
the  same  PSU  and  preferably  in  the  same  size-of- 
holding  group  (as  shown  in  the  Listing  Book)  will  be 
substituted  for  the  noninterviewed  holding.  Suppose, 
for  example,  the  fourth  sampled  holding  shown  in  ex- 
hibit IV-4-1  is  reported  as  a  noninterview  because 
there  was  no  one  at  home .  We  select  the  next  sample 
holding  in  the  same  size  category  (the  13""1  sample 
holding  in  exhibit  IV-4-1)  and  duplicate  it,  either 
by  making  up  an  identical  questionnaire  or  by  dupli- 
cating the  corresponding  cards  for  that  holding. 
This  has  the  effect  of  doubling  the  weight  for  the 
holding  which  is  duplicated.   If  a  computer  is  used, 
the  duplication  of  questionnaires  or  cards  is  unnec- 
essary; the  procedure  will  be  to  simply  double  the 
weight  of  the  interviewed  questionnaire.   If  there 
are  two  noninterviews  in  the  same  PSU,  a  second  hold- 
ing (other  than  the  one  already  duplicated)  is  selec- 
ted in  a  similar  way,  and  this  holding  is  duplicated. 

A  slight  variation  in  this  procedure  for 
small  holdings  is  necessary  in  the  Northern  Ter- 
ritory, where  every  fourth  sample  small  holding 
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is  specified  for  area  measurements .  To  keep  the 
area  measurement  sample  self -weighted,  a  noninter- 
view  among  the  holdings  specified  for  area  measure- 
ments will  require  duplication  of  another  holding 
having  area  measurements .  Similarly,  for  holdings 
not  requiring  area  measurements,  interviewed  hold- 
ings without  area  measurements  should  be  substi- 
tuted for  noninterviewed  cases . 

2.3  Adjustment  for  noninterviewed  certainty 
holdings 

As  described  earlier,  a  greater  effort  will 
be  made  to  obtain  census  data  from  Local  Certainty- 
holdings,  and  especially  from  National  Certainty 
holdings.  No  doubt  there  will  be  some  noninter- 
views  remaining  in  both  categories. 

For  Local  Certainty  holdings  the  noninterview 
adjustment  will  be  made  in  the  same  way  as  for  small 
holdings --by  duplicating  another  Local  Certainty 
holding  in  the  same  size  group.  Where  possible  the 
substitute  holding  will  be  in  the  same  PSU,  but  it 
may  be  necessary  to  use  an  interviewed  Local  Cer- 
tainty holding  from  another  PSU  in  the  same  district. 
In  either  case,  the  selection  should  be  made  accord- 
ing to  a  predetermined  procedure,  such  ej  taking  the 
next  numbered  Local  Certainty  holding  in  the  same 
PSU  or,  where  necessary,  in  the  next  numbered  PSU. 

For  National  Certainty  holdings,  we  can  ex- 
pect that  there  will  be  very  few  interviews  avail- 
able for  duplication  in  the  same  PSU  or  even 
within  the  same  district.  The  interviewed  Nation- 
al Certainty  holdings  may  also  be  very  different 
from  those  missed.  Therefore,  duplication  will 
usually  not  be  feasible.  Instead,  data  for  the 
questionnaire  will  be  imputed  on  the  basis  of  the 
best  available  data. 

3.   ADJUSTMENT  FOR  MISSING  AREA 
MEASUREMENTS 

In  some  cases  in  the  Northern  Territory,  the 
interview  portion  of  a  questionnaire  will  be  usable 
but  the  area  measurement  information  will  be  missxng 
or  not  usable.  These  will  be  treated  the  same  as 
other  cases  of  individual  missing  or  not  usable 


entries.  The  area  measurement  data  will  be  im- 
puted on  the  basis  of  the  best  information  avail- 
able, and  the  imputed  entries  should  be  consis- 
tent with  production  data  for  the  corresponding 
crops  in  Section  IV  of  the  same  questionnaire. 

4.    IDENTIFICATION  OF  DUPLICATE  CARDS 

Duplicate  cards  will  be  punched,  or  double 
weights  assigned,  primarily  for  noninterviews. 
Another  possible  reason  is  to  compensate  for  un- 
usual subsampling  rates  used  for  small  holdings 
in  PSU's  having  a  very  large  number  of  small  hold- 
ings. This  would  require  duplicating,  or  at  most 
triplicating,  all  the  small  holdings  in  a  PSU, 
since  subsampling  rates  in  those  cases  are  either 
1/2  or  1/3  of  the  usual  rate.   (See  section  4.3 
in  chapter  4  of  Unit  IV.)  Replication  eliminates 
the  need  for  additional  weight  groups  in  these 
cases. 

When  a  card  is  duplicated  (or  a  weight  is 
doubled) ,  a  code  should  be  punched  in  the  dupli- 
cate card  to  show  that  it  is  a  duplicate  and  to 
indicate  the  reason  for  the  duplication.   Such  a 
code  will  enable  the  NSO  staff  to  analyze  the 
effect  of  the  duplications  on  the  census  results. 

Sometimes  a  different  color  is  used  for 
duplicated  cards  (or  cards  with  a  doubled  weight) 
for  easy  visual  identification  in  the  deck.  A 
code  punched  in  the  card  is  still  necessary,  how- 
ever, in  order  to  sort  the  duplicated  cards  from 
the  main  deck. 

5.    ALTERNATIVE  ADJUSTMENT  PROCEDURES 

The  procedure  outlined  for  handling  nonin- 
terviews was  considered  to  be  the  most  practical 
for  Agrostan,  and  the  simplest  to  carry  out. 
Another  method  of  substitution  is  to  select  the 
holding  to  be  duplicated  at  random,  from  those 
interviewed  in  the  same  size  group  in  the  PSU. 
Any  reasonable  method  of  substitution  may  be  used 
if  it  is  predetermined  and  the  number  of  cases  is 
not  very  large.  If  the  noninterview  rate  is  large, 
no  method  of  substitution  can  be  very  satisfactory. 
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A  second  alternative  is  to  increase  the  weight 
of  all  the  "interviewed"  questionnaires  in  the  PSU 
where  the  noninterview  occurs;  the  weight  would  be 
increased  to  the  extent  needed  to  offset  the  non- 
interviews  in  the  PSU.  In  effect,  this  procedure 
substitutes  average  characteristics  for  the  miss- 
ing data,  which  is  theoretically  more  desirable 
than  multiplying  some  cards  in  the  PSU  by  2  and 
not  multiplying  others.   On  the  other  hand,  the 
procedure  of  changing  the  weight  would  result  in 
many  different  numbers  for  the  weights,  and  would 
complicate  the  tabulations;  it  is  usually  not  fea- 
sible unless  a  computer  is  available  for  process- 
ing the  data. 

6.    POSSIBLE  BIASES  IN  ADJUSTMENT 
PROCEDURES 

Duplication  of  questionnaires  or  cards  to 
adjust  for  noninterviews  will  lead  to  a  sample 
set  of  questionnaires  or  cards  for  the  same  num- 
ber of  holdings  as  if  usable  interviews  were  ob- 
tained from  all  sample  holdings .  However,  some 
serious  problems  may  still  arise  because  of  the 
noninterviews . 

(1)  The  noninterviews  may  be  concentrated  in  cer- 
tain subgroups  of  the  population,  such  as 
holdings  belonging  to  holders  who  are  either 
very  young  or  elderly.  For  such  a  group,  we 
may  be  representing  these  holdings  by  others 
which  have  rather  different  characteristics. 

(2)  If  the  noninterview  rate  (including  lost 
questionnaires)  becomes  more  than  about  5%> 
efforts  should  be  made  to  reduce  it  by  re- 
training enumerators  and  requiring  more  call- 
backs. 

7.    ESTIMATES  FROM  THE  SAMPLE 

In  planning  for  a  census  or  survey,  table 
formats  are  prepared  in  advance  showing  the  kinds 
of  estimates  to  be  made.   (Tables  for  most  of  the 
estimates  which  will  be  made  for  Agrostan  have 
been  prepared  and  are  given  in  the  appendix  of 
Unit  III.)  After  the  cards  have  been  edited, 
coded,  and  duplicated  for  noninterviews,  they  are 
tabulated  and  weighted  to  provide  the  data  for 
the  cells  shown  in  the  table  formats .  From  the 
table  formats  for  Agrostan,  it  can  be  seen  that 


the  required  estimates  are  of  the  following  types : 

(1)  Estimates  of  numbers  of  holdings  and  of 
aggregates 

(2)  Estimates  of  ratios,  percentages,  and  aver- 
ages; and  estimates  of  proportions  of  hold- 
ings having  an  attribute . 

7.1   Estimates  of  numbers  of  holdings  and  of 
aggregates 

In  the  Provinces,  estimates  of  totals  or 
aggregates  (such  as  total  number  of  holdings  for  a 
district  or  total  production  of  maize)  will  be  ob- 
tained as  follows :   If  cards  are  duplicated  for 
noninterviews,  the  sample  totals  for  Local  Cer- 
tainty holdings  will  be  multiplied  by  4  and  the 
sample  totals  for  small  holdings  will  be  multiplied 
by  16.   If  weights  of  individual  holdings  were 
adjusted  for  noninterviews,  there  would  be  addi- 
tional weight  groups .  The  weighted  totals  for 
the  two  groups  of  holdings  (the  small  and  the  Lo- 
cal Certainty  holdings)  will  be  combined  and  added 
to  the  total  for  the  National  Certainty  holdings 
located  in  the  district.  The  resulting  estimates 
of  district  totals  will  be  added  to  obtain  esti- 
mates for  the  Provinces  as  a  whole. 

Similar  methods  will  be  used  to  obtain  esti- 
mated totals  for  the  zones  in  the  Northern  Ter- 
ritory. However,  the  weight  for  small  holdings 
will  be  4-8.  The  totals  for  the  National  Certainty 
holdings  will  be  added  to  the  weighted  totals  for 
the  small  holdings  to  obtain  estimated  totals  for 
each  zone.   (There  will  be  no  Local  Certainty  hold- 
ings in  the  Northern  Territory.)  The  subsample 
of  holdings  for  which  area  measurements  are  made 
will  have  a  special  weight  for  these  measurements, 
which  is  4  x  48  or  192. 

If  the  tabulation  is  done  on  an  electronic 
computer  the  weighting  can  be  accomplished  as 
part  of  the  overall  tabulation  process.  On  con- 
ventional tabulating  equipment  the  National  Cer- 
tainty holdings,  the  Local  Certainty  holdings, 
and  the  small  holdings  would  all  be  tabulated  and 
weighted  separately  for  each  cell  in  the  tables, 
and  the  weighted  totals  for  the  different  types  of 
holdings  would  be  added  together,  cell  by  cell. 
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The  weighted  sum  for  each  cell  is  an  estimate  of 
the  characteristic  which  the  cell  represents. 

The  estimate  in  each  cell  is  subject  to  sam- 
pling error  which  will  vary  with  the  nature  of  the 
characteristic  and  the  number  of  holdings  repre- 
sented by  the  cell.  For  example,  a  district  es- 
timate of  a  total  or  an  aggregate  will  usually 
have  a  smaller  relative  error  if  the  characteristic 
is  associated  with  5,000  holdings  in  the  district 
than  if  it  is  associated  with  only  500  holdings. 
Methods  of  computing  variances  are  discussed  in 
chapter  7  of  this  unit. 

The  computation  of  variances  will  also  in- 
volve a  tabulation  and  weighting  process.   It  is 
possible  that  the  tabulation  and  weighting  for 
variance  computations  can  be  done  at  the  same  time 
that  the  tabulation  for  making  estimates  is  car- 
ried out.  This  will  depend  on  the  method  used  to 
compute  variances  and  the  type  of  tabulation  equip- 
ment used;  the  sampling  technician  should  take  an 
active  part  in  the  planning  for  tabulation  of  the 
data,  in  order  to  be  able  to  recommend  a  reasonable 
and  economical  method  for  computing  variances. 
Tabulation  processes  for  Agrostan  are  discussed  in 
Unit  VIII. 

7,2  Estimates  of  ratios,  percentages, 
averages  and  proportions 

To  estimate  ratios,  percentages,  averages, 
and  proportions,  the  procedure  is  to  make  esti- 
mates from  the  sample  for  the  numerator  and  de- 
nominator and  divide  the  estimates  to  obtain  the 
required  statistics.  All  are  ratio-type  estimates. 
For  example,  the  percentage  of  tobacco  holdings  on 
which  chemical  fertilizer  is  used  would  be  obtained 
as  follows: 

estimated  number  of  tobacco  holdings 

using  chemical  fertilizer 

estimated  total  number  of  tobacco        ' 
holdings 

where  the  estimated  numbers  in  the  numerator  and 
denominator  are  obtained  by  weighting  the  sample 
results.  In  a  few  instances,  the  ratio  of  sample 
data  may  be  used;  for  example,  if  the  character- 
istic being  estimated  is  associated  only  with  a 


single  class  of  holdings  (say  the  small  holdings 
in  the  Northern  Territory  where  only  one  weight  is 
involved),  the  use  of  unweighted  sample  results 
will  provide  the  same  estimate . 

The  reliability  of  an  estimate  of  a  ratio 
(percentage,  average,  or  proportion)  will  depend 
on  the  correlation  between  the  numerator  and  the 
denominator.  If  the  two  items  have  a  high  posi- 
tive correlation,  then  the  resulting  ratio  will 
usually  have  a  much  smaller  relative  error  than 
either  item  individually. 

7.3  Ratio  adjustment  of  estimates  of  totals 
and  aggregates 

An  estimated  total  or  aggregate  of  the  type 
described  in  section  7.1  is  often  called  an  "un- 
biased estimate"  because  it  can  be  proved  that 
its  expected  value  is  the  same  as  the  result  that 
would  have  been  obtained  in  a  complete  enumeration. 
Often,  if  certain  data  are  available,  it  is  pos- 
sible to  make  a  ratio  estimate  which  has  a 
smaller  sampling  error,  and  which  is  therefore 
preferred  even  though  it  has  a  small  (usually 
negligible)  bias. 

The  ratio  estimate  can  be  obtained  by  ap- 
plying an  adjustment  to  the  unbiased  estimate. 
It  requires  information  about  both  the  sample 
and  the  universe  in  addition  to  that  used  in  the 
unbiased  estimate.   Under  the  conditions  of  the 
Agrostan  sample,  it  is  not  expected  that  this 
adjustment  will  have  a  significant  effect.  It 
might  provide  an  important  improvement  in  dis- 
trict estimates  under  slightly  different  condi- 
tions —  especially  if  it  were  not  possible  to 
nearly  equalize  the  PSU' s  in  terms  of  population 
or  numbers  of  holdings.   In  that  case,  a  ratio 
adjustment  could  be  used  to  adjust  for  the  dif- 
ference between  the  average  population  of  the 
selected  PSU's  in  a  district  and  the  known  aver- 
age population  of  all  PSU's  in  the  district. 


Note  that  this  does  not  refer  to  an  estimate  of  a  ratio, 
but  rather  the  use  of  a  ratio  to  improve  the  estimate  of 
another  characteristic. 


42 


AGROSTAN:  A  Case  Study  for  the  1970  World  Census  of  Agriculture 


Unit  IV 


The  adjustment  assumes  that  if  the  l-in-4 
sample  of  PSU's  has  less  or  more  than  one-fourth 
of  the  true  population  of  the  district,  there  will 
be  a  proportionate  shortage  or  excess  in  the  sam- 
ple numbers  of  holdings  and  in  aggregates-  related 
to  holdings .  The  principal  textbooks  on  sampling 
applied  to  surveys  contain  discussions  of  ratio 
estimating  methods.   (See  also  section  4.7  of 
chapter  8  for  more  detail  on  method  of  computing 
ratio  estimates . ) 

8.    USE  OF  SAMPLING  ERRORS  WITH  THE 
ESTIMATES 

In  addition  to  publishing  the  estimates  from 
the  survey,  information  should  be  presented  along 


with  these  estimates  to  permit  users  to  evaluate 
their  accuracy.  Usually  it  is  not  feasible  to  pro- 
vide data  on  sampling  for  all  items  included  in  the 
survey.  However,  estimates  of  sampling  errors  for 
some  typical  or  important  items  should  be  computed 
and  included  in  the  Census  publications .  The  items 
selected  and  the  method  of  computing  sampling  errors 
should  be  such  as  to  provide  as  much  useful  infor- 
mation as  is  feasible.  Several  methods  of  computing 
sampling  errors  are  presented  in  chapter  7. 

Non-sampling  errors  and  biases  require  exten- 
sive and  costly  research,  just  to  obtain  some  very 
rough  measures  of  their  importance.  Therefore 
no  effort  is  being  made  to  include  anything  about 
them  in  the  basic  Census  publications . 


Chapter  IV  -  7.    ESTIMATES  OF  SAMPLING  ERROR 


1.    INTRODUCTION 

The  tabulation  program  will  provide  thousands 
of  estimates  for  holdings,  crops,  livestock,  etc., 
for  Agrostan.  Since  the  Census  is  based  on  a  sam- 
ple there  will  be  a  sampling  error  associated  with 
each  item.  When  the  size  of  the  sample  required 
for  the  census  was  being  determined  a  number  of 
speculations  were  made  about  the  size  of  the 
sampling  error.   After  the  survey  is  completed, 
more  precise  estimates  of  the  sampling  error  can 
be  produced. 

The  calculation  of  sampling  errors  is  costly 
and  time-consuming;  hence  it  will  be  necessary  to 
limit  the  work  to  a  small  number  of  items .  It  is 
important  that  the  method  used,  and  the  items  for 
which  variances  are  computed,  be  chosen  so  as  to 
give  results  for  the  most  important  items,  and  at 
a  reasonable  cost.  This  chapter  will  discuss 
methods  of  computing  sampling  error. 

2.    VARIANCE  OF  A  SAMPLE  ESTIMATOR 

The  estimate  from  the  sample  for  any  cell  of  a 
Census  table  is  only  one  of  many  possible  results 
that  might  have  been  obtained.   A  different  sample 
of  holdings  would  give  a  different  estimate  with 
the  same  estimating  formula,  or  "estimator."  If  es- 
timates using  a  particular  estimator  are  made  from 
all  possible  samples  which  could  be  selected,  one 
would  obtain  many  different  estimates.  The  distri- 
bution of  these  estimates  would  have  a  mean  and  a 
variance.   The  variance  of  this  group  of  estimates 
is  called  the  variance  of  the  sample  estimator  (or 
the  variance  of  the  sample  estimate) .  Knowing  the 
variance,  one  can  construct  a  confidence  interval 
for  the  estimator  (or  for  a  particular  estimate); 
and  a  statement  can  be  made  on  the  probability 
that  the  true  value  of  the  characteristic  lies 
within  the  interval. 


Obviously,  one  cannot  select  all  possible 
samples  of  holdings  in  Agrostan.  However,  an 
estimate  of  the  variance  can  be  made  from  the 
observations  in  a  single  sample,  if  a  probability 
sample  is  used.   In  the  following  discussion,  the 
theory  is  developed  in  terms  of  all  possible  sam- 
ples and  known  population  totals.   Then  some 
methods  used  to  estimate  variances  from  a  single 
sample  are  shown  (see  section  3) • 

2.1    Variance  of  an  estimate  from  a  simple 
random  sample 

If  there  are  N  holdings  in  a  district  of 
Agrostan  and  one  of  them  (the  i  )  is  selected  at 
random  in  the  sample,  an  estimate  (x1 )  of  the 
district  total  (X)  of  an  item  can  be  made  by  mul- 
tiplying the  value  observed  for  that  holding, 
(x. ) ,  by  N.   Thus,  the  estimator  is  x'  =  Nx.  .   If 
the  sample  of  one  holding  showed  5  hectares  of 
cultivated  land  and  there  were  64.OO  holdings  in 
the  district,  the  result  of  substituting  these 
values  in  the  estimator  is  to  obtain  an  estimate 
of  the  total  area  of  cultivated  land  in  the  dis- 
trict, namely  32,000  hectares.   This  is  an  un- 
biased estimate  of  the  actual  area  of  cultivated 
land  in  the  district. 

2.11  Definition  of  the  population  variance. — 

The  variance  of  this  estimate  is  called  o*2 ,  >  an^i 

is  defined  as: 

N 

(7.1)   oj,  =iZ(x:-x)2=il(Nxrx)3 

i 
N  N 

=  N  N  X(xf*)2  =N2i£(X.-X)a 


=  rro-  , 
x 

u 

where  X  =2_X.  is  the  "population  total,"  X  =  t: 

is  the  "population  mean,"  and  a2  is  the  "population 
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variance"  for  the  item.   By  definition, 


Y_  (Xi-x)a 


X  N 

which  is  the  squared  standard  deviation  of  all 
units  in  the  population,  measured  from  the  popula- 
tion mean. 

2.12  Variance  for  a  sample  of  n  holdings . — 
With  a  simple  random  sample  of  n  holdings  the 
estimate  of  the  total  is 

n 

N 


-y *■ 

n  Z_   i 


i=l 


and  its  variance  is 


o3 


(The  symbol  -  means  "is  approximately  equal  to.") 
The  latter  form  of  the  formula  is  generally  used 
when  N  is  greater  than  about  20. 

When  the  sampling  fraction  —  is  small  (less 
than  .05)  the  "finite  correction  factor"  -* — jj — '■ 
can  be  ignored;  for  example,  with  a  sampling  frac- 


tion of  1  in  100,  the  factor  becomes 


_22. 

100 


Since 


this  is  nearly  equal  to  1,  a  simplified  formula 
can  be  used: 


(7.3) 


a*,  =  N2  — 
x'      n 


2.2    Variance  for  the  Agrostan  sample  design 

The  variance  of  an  estimator  from  a  simple 
random  sample  was  discussed  in  paragraph  2.1,  in 
order  to  give  the  reader  an  understanding  of  the 
relationship  between  the  "population  variance"  and 
the  variance  of  a  sample  estimator. 

For  the  Agrostan  Census  of  Agriculture  much 
of  the  sample  was  selected  in  two  stages;  it  is 
not  a  simple  random  sample.   PSU's  were  selected 
at  the  first  stage,  and  holdings  were  selected  at 
the  second  stage.   Each  stage  of  sampling  contri- 
butes to  the  total  variance  of  each  estimate; 
there  is  a  between-PSU  component  of  the  variance, 
and  an  average  within-PSU  component. 


The  estimator  of  a  district  aggregate  for  a 
characteristic  "X"  for  Agrostan  is  computed  after 
the  noninterview  adjustment;  it  is  expressed  as: 

x'  -   16  x  (sum  of  "X"  for  all  sample  small  holdings 
in  the  district) 

+  4-  x  (sum  of  "X"  for  all  sample  Local  Certainty 
holdings  in  the  district) 

+  1  x  (sum  of  "X"  for  all  National  Certainty 
holdings  in  the  district) . 

This  estimator  has  a  within-PSU  variance  in 
the  first  term  resulting  from  the  selection  of  the 
small  holdings  within  the  sample  PSU's;  it  has  a 
between-PSU  variance  in  each  of  the  first  two  terms 
resulting  from  the  selection  of  the  sample  PSU's. 
Local  Certainty  holdings  contribute  only  to  the 
between-PSU  variance.   There  is  no  contribution  to 
the  variance  from  the  National  Certainty  holdings 
since  all  of  these  holdings  are  included  in  the 
census. 

2.21  Within-PSU  variance. — Let  us  apply  the 
discussion  of  section  2.1  to  the  within-PSU 
variance  for  the  sample  estimator.  The  formula1 
for  the  within-PSU  variance  for  the  i   PSU  in  the 
sample  is 


(7.4) 


/N.-n.  \o-2(X.) 
°w  i     A      N.      n. 


If  m  PSJ's  are  selected  in  the  sample  with  equal 
probability,  the  formula  for  the  within-PSU 
variance  becomes 


M 


M 


N.-n. 


(X.) 


(7.5)    ^'fllT 


where  az  (X. )  is  the  "population  variance"  within 

J.  l 

the  i       PSU,   and  N.    and  n.    are  the  number  of  units 
i  i 

in  the  total  population  (sampling  frame  of  small 
holdings)   and  in  the   sample,   respectively.     These 
numbers  are  approximately  80  and  20  in  Agrostan. 

Since  the  sampling  fraction     i     is  the  same  for 

Ni 
each  PSU,   Equation   (7.5)   can  also  be  written  as 


Note  that  the  quantities  in  parentheses   following 
are   subscripts,    not  multipliers.      They  are  shown  in  this 
way  to   facilitate  printing. 
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(7.6) 


cr2U< 
w 


N-n  \  w 


mn 


where  N  =  MN,  and  the  population  variance  o2 (X) 

denotes  the  average  variance  of  population  elements 

X. .  around  their  cluster  means  X.  .   Also 
13  1 


(7-7)        <<*>=£  I -fW- 


i=l 


N  -  h 
For  Agrostan  the  term — ; is  approximately 

N 

.75 j  and  should  not  be  ignored  in  computing  the 

variance.  Equation  (7.6)  shows  the  relation  between 

the  within-PSU  variance  of  the  estimator,  g2 (x1 ) , 

W 

and  the  population  variance,  <y2  (X) ,  for  a  sample 

w 

of  m  PSU's  of  average  size  n. 

2.22  Between-PSU  variance .--The  between-PSU 
variance  for  a  sample  of  m  PSU's  (for  a  district, 
a  Province,  or  the  whole  country),  a,  (x'),  is 

D 

related  to  the  population  variance,  o?(X),  by 
Equation  (7.8). 


(7-8)        <U,)=^) 


of  00 


Here  M  is  the  total  number2  of  PSU's,  m  is 
the  number  of  sample  PSU's  and  o?  (X)  is  the 

D 

"population  variance"  between  PSU  totals  (for  the 
same  area)  for  the  characteristic  "X".   In  the 
sample  for  a  district  in  the  Provinces  of  Agrostan 
there  are  two  sources  of  between-PSU  variance;  they 
are  (a)  the  Local  Certainty  holdings  and  (b)  the 
small  holdings .  The  two  groups  of  holdings  would 
be  treated  as  two  separate  strata  and  estimates  of 
variance  would  be  calculated  separately  using  the 
estimated  totals  for  each.  Equation  (7.8)  would 
be  applied  to  each  group  of  holdings  separately 
and  the  results  would  be  added  together  to  obtain 
the  total  between-PSU  variance  for  the  district. 

2.23  Total  variance. — The  total  variance  of 
a  sample  estimator  is  merely  the  sum  of  the 


In  most  districts  the  number  of  PSU1  s  is  greater  than 

M^n  M— m 

20,  so  the  factor  rpr-  has  teen  replaced  "by  "TT"  in  Equation 

(7.8). 


within-PSU  and  between-PSU  variances.     For  an 
estimate  of  a  total    (x' )   we  have 


(7.9) 


=  M2 


/M-m\ 
\    M  / 


<(X) 


+  N' 


N-n 


£(x) 


mn 


It  is  assumed  that  o?(X)  in  the  first  term 


in  the  formula 


M3 


M-m 

M 


a?(X) 


is  the  sum  of  population  variances  obtained  from 
the  small  holdings  and  from  the  Local  Certainty 
holdings.   The  multiplier  in  that  term  is  the  same 
for  both  kinds  of  holdings,  and  can  be  simplified 
to  the  factor  M  ^M~m'  . 


2.3   Relative  variance  and  coefficient  of 
variation 

The  relative  variance  of  any  estimate  is  the 
variance  divided  by  the  square  of  the  estimate. 
For  example, 

4. 


x' 


(x'): 


is  the  relative  variance  of  x' .   The  square  root 
of  the  relative  variance  is  defined  as  the  coeffi- 
cient of  variation  or  "relative  error".   Thus, 


V  .= 


x»   x1 

is  the  coefficient  of  variation  of  the  estimate  x' , 

3.    ESTIMATE  OF  TOTAL  VARIANCE  FROM 
THE  SAMPLE 

Several  methods,  which  require  considerably 
less  work  than  estimating  components  of  variance, 
can  be  used  to  estimate  the  total  variance  of  the 
sample  estimate  from  the  sample  itself.   In  this 
section,  one  such  method  will  be  discussed. 

3.1    Effect  of  systematic  selection 

The  sample  for  Agrostan  is  a  systematic  selec- 
tion of  PSU ' s  at  the  first  stage  and  a  systematic 
selection  of  holdings  within  the  selected  PSU's  at 
the  second  stage. 
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The  systematic  selection  may  improve  the 
sample,  since  it  takes  advantage  of  any  inherent 
stratification  which  results  from  the  order  in 
which  PSU's  are  listed,  and  of  the  order  of  the 
holdings  within  PSU's.   The  order  may  reflect 
geographic  location,  type  of  agricultural  opera- 
tions, climate,  etc. 

Any  formula  for  the  variance  of  a  systematic 
sample  usually  leads  to  an  overestimate  of  the 
variance;  such  an  overestimate  is  conservative,  in 
that  the  true  variance  may  be  expected  to  be 
smaller  than  the  estimated  variance.  Most  statis- 
ticians are  satisfied  with  a  slight  overestimate 
of  the  variance,  since  then  they  may  say  that  the 
estimates  are  probably  more  reliable  than  the 
computed  variances  would  indicate. 

3.2    The  grouped  stratum  method 

One  of  the  methods  for  estimating  variance  is 
the  grouped  stratum  method.   If  it  is  assumed  that 
the  m  PSU's  in  the  Agrostan  sample  were  selected 
in  pairs  from  strata  (2  out  of  every  8  at  random 
instead  of  1  out  of  every  U   systematically),  an 
estimate  of  the  contribution  to  the  total  variance 
from  the  small  holdings  can  be  made  very  easily; 
one  determines  the  difference  between  PSU  sample 
totals  for  small  holdings  in  each  pair,  squares 
the  difference,  weights  by  the  square  of  the  over- 
all weight,  and  sums  over  all  pairs  of  PSU's. 
The  formula  for  this  procedure  is 


(7.10) 


(16); 


Z(W2* 


h=l 


where  x'   and  x'   are  unweighted  sample  totals  for 
small  holdings  in  the  PSU's  in  pair  h,  and  16  is 
the  overall  weight  for  the  sample  holdings. 

To  estimate  the  contribution  to  the  total 
variance  from  the  Local  Certainty  holdings  (which 
were  selected  at  a  rate  of  1  in  4-)  formula  7.10 
would  be  used,  with  the  factor  (l6)2  replaced  by 

U)3. 

An  alternative  estimate  of  the  total  variance 
for  both  sampled  small  holdings  and  Local  Certainty 
holdings  can  be  obtained  by  multiplying  the  PSU 


total  for  each  group  by  its  proper  weight,  adding 
the  two,  and  using  the  formula 


(7.11) 


ss.  =£  <srV 


h=l 


where 

x'   =  4x,   T  (Local  Certainty)  +  l6x    (small 
<\  hx ,  h  hj ,  S 

holdings).  Equation  (7.1l)  may  produce  a  different 
estimate3  from  Equation  (7.10).  Each  equation — 
(7.11)  alone,  or  (7.10)  with  weights  16  and  U   and 
the  results  added — will  produce  an  unbiased  esti- 
mate of  the  total  variance;  that  is,  the  expected 
values  of  the  two  estimates  (over  all  possible 
samples)  is  the  same. 

Calculation  of  variance  estimates  using  this 
method  is  considerably  simpler  than  most  other 
methods  since  estimates  of  PSU  totals  and  the 
squares  of  the  sums  of  totals  are  not  required. 
The  amount  of  work  required  is  minimal,  making  the 
method  suitable  for  censuses  and  surveys  where 
variance  estimates  are  wanted  for  a  large  number 
of  characteristics.  Moreover,  the  method  is  suit- 
able to  hand  computation,  which  may  be  desirable 
if  not  many  variances  are  to  be  computed. 

If  the  order  in  which  PSU's  are  listed  brings 
similar  PSU's  together,  the  assumption  that  the 
PSU's  have  been  stratified  is  good.   The  method 
described  above  will  give  a  closer  estimate  of  the 
true  variance  than  the  assumption  (for  example) 
that  the  PSU ' s  were  selected  by  simple  random 
sampling  from  the  whole  list  of  PSU's. 

If  the  sample  units  are  selected  with  unequal 
probabilities,  PSU  sample  totals  would  have  to  be 
weighted  before  the  differences  are  taken  and  then 
Equation  (7.11)  would  be  used.   With  a  self- 
weighting  sample  (which  may  be  the  case  for  some 
items,  such  as  the  number  of  small  holdings)  the 
weighting  can  be  done  in  a  single  operation  using 
Equation  (7.10).   For  an  estimate  of  the  number  of 
Local  Certainty  holdings,  Equation  (7.10)  would  be 
used  with  a  factor  of  (U)z ,   rather  than  (l6)2. 


Equation  (?.ll)  introduces  a  cross  product  term  between 

4x     and  l6x ,   _  in  each  stratum;  its  expected  value  over 
h.3.,1       ^1,5  *^ 


all  strata  (over  all  samples)  is  zero. 
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3.3   Estimates  of  variance  for  districts 

For  most  districts  the  number  of  PSU's  in  the 
sample  is  small  (less  than  20) ,  and  the  pairing  of 
PSU's  (as  if  selected  from  strata)  will  have  very- 
few  degrees  of  freedom,  so  that  the  estimate  of 
variance  will  not  be  very  reliable.   A  better 
estimate  of  the  variance  of  a  total  will  be  ob- 
tained by  grouping  several  districts  together, 
computing  the  estimate  for  the  group  of  districts, 
and  adjusting  the  estimate  to  take  into  account 
the  number  of  PSU's  in  each  individual  district. 

Example  1:  Bolivar  Province,  which  had  221 
rural  EA's  and  42  urban  EA's  in  1970,  will  have  62 
PSU's  in  the  sample.  Variances  can  be  computed 
using  31  "strata",  each  represented  by  2  PSU's 
which  are  paired  in  the  order  of  selection. 

Estimate  for  a  single  district;  Consider  the 

single  district  Monaco,  which  had  47  EA's  in  1970. 

Suppose  12  of  these  have  been  selected  as  PSU's 

for  the  sample.   A  variance  for  this  district  is 

estimated  by  multiplying  the  estimated  variance 

ft? 
for  62  PSU's  in  Bolivar  Province  by—. 

Example  2;  Raj pur  Province  had  1057  rural 
EA's  and  175  urban  EA's  in  1970  in  13  districts 
(or  combinations  of  districts).   The  EA's  were 
grouped  into  1048  PSU's,  and  264  were  selected  for 
the  sample . 

It  is  not  necessary  to  use  all  264  PSU's  in 
the  variance  computation.   A  sample  of  30  to  50 
can  be  selected  systematically;  each  of  these  is 
then  paired  with  the  next  PSU  selected  in  the 
Province,  and  the  60  to  100  PSU's  can  be  used  to 
compute  the  variance. 

Estimate  for  whole  Province:  If  50  pairs  of 
PSU's  were  used  in  the  variance  computation,  an 
estimate  of  the  variance  for  the  whole  of  Raj  pur 
Province  can  be  made  by  multiplying  the  estimate 
just  found  by  n j"  to  take  account  of  the  fact  that 
there  were  actually  264  PSU ' s  in  the  sample  in  the 
Province  (not  100) . 

However,  since  Raj pur  is  a  large  Province, 
with  much  variability  in  climate  and  type  of 
agricultural  operations,  it  would  be  better  to 
group  the  13  districts  into  2  or  3  sub-areas, 


which  are  relatively  similar,  and  to  compute  an 
estimate  of  the  variance  for  each  sub-area,  using 
about  4.0  pairs  of  sample  PSU's  in  each  sub  area. 

3.4  Estimates  of  variances  for  large 

Provinces 

The  variance  for  a  large  Province  is  the  sum 
of  the  variance  for  the  sub-areas  within  it.   The 
variance  for  a  single  district  is  obtained  in  the 
same  way  as  in  Example  1,  using  the  variance  for 
the  sub-area  in  which  the  district  lies  as  the 
basis  for  the  computation. 

For  example,  if  Ranchi  district  (which  has 
21  PSU's  in  the  sample)  is  in  a  sub-area  having 
85  sample  PSU's,  the  variance  for  the  sub-area 
could  be  computed  using  80  PSU ' s  and  then  multi- 

plying  by  -r~.  For  Ranchi  district,  itself,  the 

80 
estimate  of  the  variance  would  be -r—  times  the 

variance,  computed  from  the  80  PSU's. 

3.5  Estimates  of  variances  for  the  nation 

The  sample  for  the  whole  country  of  Agrostan 
contains  about  3,000  PSU's.  A  quick  estimate  of 
the  variance  of  a  total,  which  may  be  sufficiently 
reliable  for  crops  which  are  grown  in  all  areas  of 
the  country  (such  as  maize  or  beans),  can  be  made 
by  selecting  a  systematic  sample  of  100  pairs  of 
PSU ' s  and  computing  a  variance  by  the  paired-PSU 
procedure.  Of  course  the  estimate  obtained  must 
be  multiplied  by  a  factor  (in  this  case  '   Q  =  15) 
to  get  an  estimate  of  the  variance  for  the  whole 
country. 

The  reason  that  such  short  cut  methods  can  be 
used  is  that  the  variance  of  a  major  crop  is  a 
fairly  stable  characteristic.   An  estimate  of  the 
variance  using  200  PSU's  will  be  quite  reliable, 
and  not  differ  too  much  from  the  estimate  obtained 
using  all  3,000  PSU's.  Moreover,  sampling  errors 
ar«  usually  stated  in  percentages,  with  about  2 
significant  figures;  an  error  of  10$  in  the  esti- 
mate of  the  sampling  error  is  not  usually  con- 
sidered serious. 

A  better  method  of  obtaining  national  vari- 
ance estimates  is  to  add  the  corresponding 
variances  for  all  the  Provinces.  Since  variances 
are  required  for  each  Province,  this  is  a  simple 
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way  to  obtain  the  variance  for  the  whole  country. 
It  has  the  added  advantage  of  giving  more  reliable 
estimates  for  crops  (such  as  cotton)  which  are 
grown  only  in  a  few  Provinces. 

4.    ESTIMATE  OF  COMPONENTS  OF  VARIANCE 
FROM  SAMPLE 

After  the  survey  estimates  have  been  obtained, 
the  sampling  procedures  should  be  examined  with  a 
view  to  designing  more  efficient  samples  for  cur- 
rent agricultural  surveys  and  for  the  next  agri- 
culture census.  This  usually  requires  an  examina- 
tion of  the  components  of  the  variance  for  a  few 
of  the  most  important  characteristics  needed  for 
planning  surveys;  this  task  can  be  carried  out 
after  the  census  operation  is  completed.  Even 
though  these  computations  can  be  delayed,  it  is 
important  that  they  be  carried  out. 

4.1   Between-PSU  variance 

The  total  variance  is  the  sum  of  the  within- 
PSU  and  the  between-PSU  variances;   therefore  an 
estimate  of  one  of  the  components  can  be  obtained 
by  computing  the  other  and  subtracting  from  the 
estimate  of  total.     Mathematically,   this  can  be 
expressed  in  terms  of  the    population  variances  as: 


(7.12) 


o?(X)  =o2(X)   -  aJ(X)   . 


The  total  variance  can  be  estimated  directly 
from  the  sample,  by  the  method  described  above. 
Methods  of  estimating  the  within-PSU  population 
variance  are  given  in  the  next  section. 

4.2    Within-PSU  variance 

The  formula  for  estimating  the  within-PSU 
variance  is 

(7.13:      b3(X.)=^£  Ni"^-  W 

* —        i  i 

i=l 


M2^2  N  -  n 


Nn 


y  -^-s2(x.) 

1=1 


where 


(7.U)     s2(X.)  =— ^  Vfx-.~  *■) 

3=1 


is  an  estimate  of  the  population  variance  ^(X.) 
based  on  the  sample  holdings  within  the  PSU.  A  sim- 
plified formula  for  computation  is 


s2(X.)  = 

w   1 


(n.-  1) 

i 


I> 


4.3  Remarks 


The  components  of  variances  can  be  calculated 
for  the  nation  as  a  whole,  for  Provinces,  or  for 
individual  districts,  depending  on  the  level  at 
which  estimates  will  be  made  for  the  survey  or 
census  that  is  being  planned.  For  a  national 
census  where  estimates  will  be  made  at  all  levels, 
the  components  of  variances  for  a  few  districts 
will  be  needed  to  guide  the  statistician  in  de- 
signing the  sample  for  future  surveys.  For  cur- 
rent surveys  designed  to  produce  national  estimates 
only,  the  components  of  variance  for  a  national 
sample  will  be  needed.  Often  the  statistician  can 
use  estimates  of  components  for  a  few  districts  or 
Provinces  as  being  indicative  of  the  components  for 
the  whole  nation.   In  any  event,  the  PSU's  can  be 
sub-sampled  when  making  estimates  of  variance  for 
the  whole  country,  in  order  to  reduce  the  amount 
of  work  to  a  manageable  level.  Highly  reliable 
estimates  of  components  of  variances  are  usually 
not  required,  since  components  of  variance  will  be 
used  primarily  in  trying  to  optimize  future  sample 
designs  and  since  several  different  designs  may  be 
close  to  the  optimum. 

4.4    Using  components  of  variance 

Components  of  variance  may  be  used  in  several 
ways  in  optimizing  a  sample  design,  depending  on 
the  general  form  of  the  sample  design.  For  a 
simple  design  such  as  the  one  for  the  Agrostan 
Census  of  Agriculture  the  components  can  be  used 
together  with  cost  data  (or  assumptions  of  relative 
costs)  to  find  approximately  the  optimum  number  of 
clusters  (m)  and  holdings  within  the  clusters  (n) 
for  a  given  precision.  Usually  the  optimum  is 
quite  broad,  so  that  fairly  large  changes  in  the 
number  of  PSU's  selected,  and  in  the  number  of 
holdings  in  the  sample  PSU's,  may  not  greatly 
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affect  the  reliability  of  the  estimates.  One  of 
the  responsibilities  of  the  sampling  statistician 
is  to  know  when  the  range  around  the  optimum  is 
rather  broad. 

5.    VARIANCE  OF  THE  ESTIMATE  OF  A  RATIO 
AND  OF  A  RATIO  ESTIMATOR 

In  an  earlier  section  it  was  mentioned  that 
variances  for  estimates  of  ratios  such  as  per- 
centages, averages,  and  proportions,  and  ratio 

estimates  of  totals  or  aggregates  require  a  sepa- 

x' 

rate  discussion.  The  estimator  r  = — :  (of  the 

X  y 

ratio  R  = — )  and  the  estimator  X"  =  rY  (of  the 

level  of  the  characteristic  X)  are  related,  since 
x"  is  Y  times  r.   In  the  estimator  of  aggregates, 
Y  is  a  known  total  from  an  independent  source 
(for  example,  the  1970  Census  population)  and  does 
not  depend  on  the  sample;  hence  it  makes  no  contri- 
bution to  the  sampling  error.   In  fact,  the  rela- 
tive errors  of  r  and  x"  are  identical. 

5,1    Population  variance  of  a  ratio  estimator 

The  relative  variance  of  the  ratio  (r)  of  two 
random  variables  can  be  approximated  very  closely, 
if  the  sample  is  moderately  large.  The  formula  is 
as  follows : 


(7.15) 

V2  =  V2  +  V2,  -  2V  ,  , 
r          x'           y  ■              x'  ,y' 

where 

v2    =fi!,      vj,  =fi 

V      IF           y        Y2 

and 


:',y' 


'X,Y   XY 


Here  p    is  the  correlation  coefficient  between 
X  and  Y  for  the  sample  design  being  used. 

It  can  be  shown  easily  that 


(7.16) 


Vs. 


V2  • 

r 


The  corresponding  population  variances  are: 
(7.17)   <4=R?V?  ±R2    OP,    +  V2,    -  2V   ,      ,) 


>r 


:-fe  (<£,  +RVV,  -2RaYl     .) 


*',y 


and 

(7.18)  <£■  =  *3n  £  Y2o-; 


<£.  +  R24- 


'x',7' 


5.2   Sample  estimate  of  the  variance  of  a 
ratio  estimator 

Sample  numbers  may  be  used  to  make  estimates 
of  the  variances  in  Equations  (7.15)  to  (7.18). 
The  estimates  are  respectively: 


(7.19)      s|, 

v2  =  — 

r   (x<)2 


11 


(y'): 


2(s  ,   .) 

x' ,y' ' 

x'y' 


(y'): 


(s2  +  i^s2  ,-  2rs  , 

x'         yl      x' 


.,) 


s5..=  sf 


+  i^s2  -  2rs 

y1 


'x»,y» 


These  estimates  all  involve  ratios  of  random 
variables,  and  consequently  are  subject  to  the 
biases  inherent  in  ratio  estimates.4   If  the  sam- 
ple sizes  are  relatively  large,  the  biases  will  be 
small,  and  can  be  ignored.   In  practice,  the  esti- 
mates of  variances  using  Equation  (7.19)  may  be 
considered  to  be  approximately  unbiased,  if  the 
sample  is  fairly  large,  and  the  population  is  not 
highly  skewed.  For  Agrostan,  we  have  removed  most 
of  the  skewness  of  the  population  by  the  use  of 
the  National  and  Local  Certainty  lists. 

5.3  Estimate  of  a  ratio  or  proportion 

The  most  important  ratio  estimates  in  the 
Agrostan  Census  will  be  proportions  obtained  by 
taking  the  ratio  of  two  characteristics;  for 
example,  holdings  on  which  cropland  is  irrigated 
as  a  proportion  of  holdings  with  land  in  crops. 
The  estimate  of  such  a  ratio  is  obtained  by  dividing 
the  estimated  totals  for  the  two  characteristics. 
Suppose  the  ratio  of  these  two  characteristics  in 
a  certain  district  is  r  =  ',-.  =  60.3$. 

An  approximate  estimate  of  the  rel-variance  of 
the  ratio  may  be  obtained  by  applying  the  formula 

(7.20)  V2  =  V2   -  V2  • 


4Ihe  estimates  in  Equation  (7.19)  axe  "consistent" 
estimates;  that  is,  the  bias  decreases  and  approaches 
zero  as  the  size  of  the  sample  "becomes  very  large. 
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To  derive  Equation  (7.20),  note  the  relationship 


(7.21) 


Vs  =  V2  +  V2  +  2V 

UW      U      w       uw 


for  the  product  uw,  and  consider  the  identity 
x'  =  J»  -p-  =  y'r. 

Using  (7.21),  the  variance  of  x'  becomes 


(7.22) 


V2,  =  V2,  +  Vs  +  2V  . 
x'     y '     r     y ' ,r 


In  many  situations,  the  correlation  between  y'  and 
r  is  very  small  so  that  V  ,  -  0  and  V2 ,  is  approxi- 
mately V2 ,  +  V2 .  Transposing  terms  gives  Equation 
(7.20). 

6.    ILLUSTRATION  OF  VARIANCE  COMPUTATION 

It  will  be  assumed  that  the  variance  program 
in  Agrostan  will  be  limited  to  about  20  items  con- 
sisting of  (a)  obviously  important  items  (b)  items 
for  which  variance  computations  have  been  requested, 
and  (c)  a  selection  of  items  with  high,  low,  and 
medium  variances.   The  items  might  include: 

(1)  Total  number  of  holdings  by  tenure  classes 

(2)  Land  in  holdings 
Arable  cropland 
Average  size  of  holding 
Number  of  cattle  on  hand 


(3) 
(4) 

(5) 
(6) 


Fertilizer  used  -  amount 

-  proportion  of  holdings 
using  fertilizer 


(7)  Maize  harvested  -  area 

-  yield 

(8)  Groundnuts  harvested  -  area 

-  yield 

(9)  Coffee  harvested  -  area 

-  yield 

(10)  Cash  from  sale  of  agricultural  products 

(11)  Number  of  wheel  tractors  used  on  holding 

(12)  Irrigated  land  -  number  of  holdings 

-  area 

(13)  Hired  workers  for  pay  in  cash 
(14.)  Cash  wages  paid  to  hired  workers 
(15)  Number  of  family  workers 

Exhibit  IV-7-1  illustrates  a  worksheet  which 
can  be  used  in  the  grouped  stratum  method  of  cal- 
culating variances.  For  Bolivar  Province,  the 
62  PSU's  are  used  to  calculate  variances.  The  two 
characteristics  under  consideration  are  (a)  total 
number  of  holdings  and  (b)  number  of  holdings 
using  chemical  or  other  commercial  fertilizer. 

Column  8  of  Form  A-281  gives  the  covariance 
which  is  needed  to  calculate  variances  for  the 
ratio  of  the  number  of  holdings  using  chemical 
fertilizer  to  the  total  number  of  holdings.   The 
totals  on  the  bottom  line  of  columns  2  and  5  give 
the  total  number  of  holdings  and  the  total  using 
fertilizer;  the  totals  for  columns  4  and  7  give 
the  variance  estimates  for  these  items. 
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1.    INTRODUCTION 

By  "sample  design"  we  refer  to  the  total  plan 
for  collecting  information  and  drawing  inferences 
about  a  population.  The  sample  design  thus  in- 
cludes sample  plan,  enumeration  plan,  and  estimat- 
ing methods .  These  three  aspects  of  the  sample 
are  so  closely  related  that  a  change  in  any  one 
usually  requires  changes  in  the  others,  and  de- 
cisions about  all  three  must  be  made  jointly,  es- 
pecially the  first  and  third  aspects . 

In  the  previous  chapters,  two  sample  designs 
were  described .  The  design  for  the  Provinces 
satisfies  the  following  conditions.   It  is  basic- 
ally a  two-stage  sample  of  holdings,  with  modified 
enumeration  areas  (EA's)  of  nearly  equal  popula- 
tion size  as  first-stage  units.  Equal  probability 
sampling  is  used  at  both  stages .  Very  large  hold- 
ings (National  Certainty  holdings)  are  included 
with  certainty  and  medium  size  holdings  (Local  Cer- 
tainty holdings)  are  sampled  at  an  intermediate 
rate .   It  is  assumed  that  holders  can  give  usable 
information  on  all  items  in  an  interview.  The 
principal  method  recommended  for  estimating  popu- 
lation totals  is  a  simple  expansion  by  the  re- 
ciprocal of  the  sampling  rate  for  each  of  the 
three  categories  of  holdings . 

The  design  for  the  Northern  Territory  satis- 
fies the  same  conditions  with  several  exceptions. 
The  first-stage  units  are  not  equal  in  size  and 
they  are  selected  with  unequal  probabilities; 
there  are  no  Local  Certainty  holdings.  Also,  since 
it  is  assumed  that  holders  generally  cannot  give 
usable  information  on  land  area,  a  subs ample  of 
holdings  is  measured  objectively.  Estimates  are 
made  by  simple  expansion  as  in  the  Provinces, 
since  the  overall  probability  of  selection  is  con- 
stant. 

The  possible  choices  among  alternative  sample 
designs  in  a  country  depends  greatly  on  the  types 


of  administrative  divisions,  the  number  of  units 
of  each  type,  their  relation  to  the  publication 
level  required,  and  the  personnel  available  to 
take  the  Census . 

The  following  tabulation  shows  the  types  and 
numbers  of  administrative  divisions  in  Agrostan. 
The  last  column  shows  the  relation  of  each  type 
to  the  sample  design  by  identifying  those  which 
are  primary  sampling  units  (PSU's)  and  those  for 
which  estimates  are  required  in  Agrostan. 


Administrative 
division 


Number 


Relation  to 
sample  design 


Provinces,  total....  1  Publication  level 

Province 13  Publication  level 

District  (modified) 150  Publication  level 

Subdistrict 1, 093 

EA  (modified) 14,936  PSU's 

Northern  Territory..  1  Publication  level 

Zone 3  Publication  level 

Territorial  district....  18 

Chief doms  and  EA's 1,986  PSU's 


The  smallest  administrative  division,  the 
chiefdom  or  EA,  was  chosen  as  the  PSU  in  both 
parts  of  the  country  (modified  EA's  in  the  Prov- 
inces). Estimates  of  specified  accuracy  are  re- 
quired for  the  Provinces  as  a  whole,  and  for  each 
Province  and  each  district  making  up  the  Provinces, 
but  only  the  total  and  each  of  the  three  zones 
making  up  the  Northern  Territory  require  publica- 
tion. 

Other  countries  may  have  different  census 
objectives,  different  size  (area,  population,  and 
amount  of  agriculture),  different  types  of  admin- 
istrative divisions,  different  resources,  and 
other  variations  from  the  conditions  assumed  for 
Agrostan.   Possible  ways  of  changing  the  Agrostan 
design  to  fit  various  conditions  are  discussed  in 
this  chapter. 
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2.    MODIFICATIONS  RELATED  TO  CENSUS 
OBJECTIVES 

Most  countries  will  want  to  publish  census 
results  for  the  smallest  administrative  division 
feasible.  What  is  feasible  will  depend  on  the  in- 
tended uses  of  the  data  and  the  resources  avail- 
able. Furthermore,  the  objectives  may  vary  for 
different  items  of  information. 

2.1    Smaller  publication  areas  or  greater 
accuracy 

In  Agrostan,  the  smallest  area  requiring 
estimates  is  the  district,  having  a  minimum  of 
about  2,500  holdings.  There  are  still  two  small- 
er areas --the  subdistrict  and  the  EA.  The  EA 
was  chosen  as  the  PSU.   If  estimates  were  required 
for  areas  smaller  than  districts,  such  as  subdis- 
tricts  (or  if  estimates  for  districts  were  re- 
quired to  have  greater  accuracy),  it  would  be 
necessary  to  increase  the  sample  size  or  to  change 
the  distribution  of  the  sample.  We  could  take  one 
of  the  following  steps,  which  are  listed  in  order 
from  the  simplest  and  least  expensive  (and  also 
least  effective)  to  the  most  drastic  and  expensive 
(but  most  effective).  We  could — 

(1)  Increase  the  number  of  sample  holdings  per 
PSU. 

(2)  Increase  the  number  of  sample  PSU's. 

(3)  Increase  both  the  number  of  holdings  and  the 
number  of  PSU's  in  the  sample. 

(4)  Use  different  sampling  fractions  in  districts 
(or  subdistricts )  of  different  sizes. 

(5)  Omit  the  PSU  stage  and  select  as  many  hold- 
ings as  required  throughout  the  district, 
either  singly  or  in  clusters  of  neighboring 
noldings  from  the  list  of  all  holdings . 

We  would  then  choose  the  least  costly  step 
that  would  be  expected  to  produce  the  desired  re- 
sults .  Step  1  alone  would  probably  not  be  suffi- 
cient; there  would  be  too  few  PSU's  per  subdis- 
trict (since  these  are  smaller  than  districts) 
either  to  provide  good  subdistrict  estimates  or 
to  substantially  improve  the  district  estimates . 
The  district  sample  in  Agrostan  includes  one- 
fourth  of  the  PSU's.  Even  if  we  increased  this 
number  by  taking  one -half  of  the  PSU's  (step  2), 


there  would  still  be  too  few  sample  PSU's  in  most 
subdistricts  for  acceptable  subdistrict  estimates; 
and  only  a  slight  improvement  in  the  accuracy  of 
district  estimates  would  result.  Step  3,  combining 
steps  1  and  2,  would  have  a  greater  effect  but 
probably  not  enough  to  produce  subdistrict  estimates 
of  the  standard  of  accuracy  set  for  districts  in 
Agrostan.   (Step  4  is  a  variation  of  step  3.)   It 
would  probably  be  necessary  to  take  step  5,  and 
select  a  single-stage  sample  of  holdings. 

In  another  country,  where  the  frame  contains 

a  much  larger  number  of  PSU's  and  the  sample  is 

selected  at  a  lower  rate  than  one-fourth,  it  would 

be  possible  to  obtain  more  improvement  within  the 

two-stage  plan.   In  considering  other  countries, 

it  is  important  to  note  that  the  sampling  rates 

at  each  stage,  as  well  as  the  overall  rate  (=■  x  -  = 
n  4   4 

j£  ) ,  depend  upon  the  number  of  administrative 
divisions  at  different  levels.  The  fact  that  first- 
and  second-stage  rates  are  equal  for  Agrostan  is 
accidental.  The  resulting  number  of  sample  hold- 
ings, which  is  20  X  20  =  400  (plus  certainty  hold- 
ings), may  be  a  suitable  number  for  other  countries 
and  other  levels  of  publication.  One  can  expect  to 
obtain  estimates  of  about  the  same  accuracy  as 
Agrostan  district  estimates  for  a  publication  area 
with  a  sample  of  400  holdings  located  at  20  spots 
(PSU's)  throughout  the  area,  and  supplemented  by 
50  to  100  more  heavily  sampled  large  holdings  in 
the  area.  In  a  country  larger  than  Agrostan,  the 
result  20  x  20  =  400  might  be  obtained  by  f irst- 
and  second -stage  rates  of  1  in  10  and  1  in  4,  with 
an  overall  rate  of  1  in  40,  or  by  another  combina- 
tion leading  to  an  overall  rate  less  than  that 
used  in  Agrostan. 

2.2  Larger  publication  areas  or  less  accuracy 

If  results  are  needed  only  for  larger  areas, 
one  would  still  need  about  400  holdings  in  a  mini- 
mum of  20  PSU's  (supplemented  by  large  holdings) 
in  each  such  area.  One  would  probably  still  use 
a  two-stage  sample  and  would  choose  as  PSU's  (or 
as  the  basis  for  forming  PSU's)  a  unit  which  is 
sufficiently  numerous  and  sufficiently  large  for 
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the  purpose .  There  should  be  at  least  60  of  these 
units  in  the  universe  and  the  units  should  contain, 
on  the  average,  60  or  more  holdings.  Then  it 
would  be  feasible  to  select  a  sample  of  about  20 
PSU's  and  20  holdings  per  PSU  to  represent  the 
area .  The  sample  would  still  be  a  two-stage  sam- 
ple, but  the  PSU  might  be  a  somewhat  larger  unit 
than  the  EA's  in  Agrostan.   If  there  are  several 
types  of  administrative  divisions  in  the  country 
satisfying  these  conditions,  one  should  select 
as  the  PSU  the  smallest  unit  that  is  administra- 
tively feasible,  which  might  still  be  the  Popu- 
lation Census  EA.  The  4-00  holdings  needed  for  the 
larger  area  would  be  obtained  from  a  lower  over- 
all sampling  rate,  which  might  result  from  lower 
rates  at  either  or  both  stages  compared  with 
Agrostan.  However,  the  number  of  first-stage 
units  should  be  at  least  20,  if  possible . 

If,  on  the  other  hand,  we  could  accept  dis- 
trict estimates  of  lower  accuracy  than  for  Agro- 
stan, the  sampling  rates  would  be  reduced  by  the 
converse  of  the  first  three  steps  described  above — 
that  is,  by  selecting  fewer  holdings  per  PSU,  fewer 
PSU's  per  district,  or  both.  The  converse  of 
step  5  would  be  to  increase  the  number  of  stages 
of  sampling,  but  this  might  not  be  feasible .   In 
general,  with  a  given  sample  plan,  changes  in  the 
required  accuracy  level  result  in  changes  in  the 
required  size  of  sample;  a  smaller  sample  is 
sufficient  for  less  accurate  estimates,  and  con- 
versely. This  results  from  the  fact  that  sampling 
error  is  related  inversely  to  Vn,  and  this  rela- 
tion is  roughly  true  even  for  complex  designs . 

2.3  Different  requirements  for  different  items 

There  may  be  many  items  in  the  questionnaire 
which  can  be  estimated  from  a  smaller  sample  than 
the  general  level  required  for  other  items,  for 
one  of  the  following  reasons : 

(1)  They  are  less  variable  than  the  basic  items 
for  which  the  sample  was  designed. 

(2)  They  are  to  be  estimated  only  for  larger 
areas  than  the  basic  items . 

(3)  The  level  of  accuracy  required  is  lower  than 
for  the  basic  items . 


(4-)  They  are  attributes  (for  example,  questions 
answered  by  "yes"  or  "no"). 

It  may  be  sufficient  to  obtain  these  items 
from  a  subsample  of  the  original  sample .  The 
subsample  is  usually  selected  by  a  simple  pro- 
cedure, such  as  systematic  sampling.   Information 
is  obtained  at  the  same  time  as  for  the  ori- 
ginal sample  of  holdings.  The  questions  may  be 
part  of  a  longer  version  of  the  regular  question- 
naire or  they  may  appear  on  a  separate  question- 
naire, which  is  filled  in  addition  to  the  regular 
questionnaire  for  the  subsample  of  holdings .  Since 
there  would  be  no  saving  in  travel  costs  from 
further  clustering,  the  sample  should  be  selected 
in  all  original  sample  PSU's.  The  subsample  may 
be  selected  by  a  method  similar  to  that  used  to 
select  the  subsample  for  area  measurements  in  the 
Northern  Territory.  Where  it  is  feasible  to  divide 
the  items  into  two  categories  requiring  signifi- 
cantly different  sample  sizes,  this  procedure  will 
result  in  a  substantial  saving  in  interviewing  and 
processing  time  and  costs . 

3.    MODIFICATIONS  RELATED  TO  SIZE  OF 
COUNTRY 

For  many  types  of  surveys,  the  complexity  of 
the  sample  design  will  increase  with  the  size  of 
the  country.  This  is  especially  true  when  only 
national  or  large-region  estimates  are  required. 
However,  it  is  usually  the  objective  of  a  census 
(whether  sample  or  complete)  to  produce  results 
for  rather  small  areas,  and  the  sample  must  be 
designed  to  provide  estimates  for  these  small 
areas .  The  size  of  these  areas  is  frequently 
about  the  same  from  country  to  country.  The  sam- 
ple design  for  these  small  areas  will  therefore 
be  similar,  whatever  the  size  of  the  country. 
The  enumeration  plan  may  be  affected,  however. 
If  the  number  of  small  areas  to  be  published  is 
much  larger  than  the  number  for  Agrostan,  signifi- 
cant changes  would  be  required  in  the  number  of 
enumerators,  the  training  program,  the  field  or- 
ganization, and  other  aspects  of  the  census. 
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4.    MODIFICATIONS  RELATED  TO  AVAIL- 
ABILITY OF  RESOURCES 

Many  assumptions  were  made  about  the  avail- 
ability of  previous  census  data,  maps,  money, 
personnel,  and  transportation  facilities  in  the 
Agrostan  design.  The  design  in  the  Northern 
Territory  provides  an  example  of  some  variations 
in  these  assumptions .   It  includes  the  case  where 
there  was  no  previous  agriculture  census,  and  maps 
from  the  population  census  were  not  very  complete 
or  accurate.  Many  of  the  assumptions  about  re- 
sources are  reflected  in  the  census  objectives, 
such  as  the  accuracy  required,  and  the  level  of 
publication.  Their  effect  on  sample  design  has 
already  been  discussed  in  section  3.  Some  possible 
assumptions  about  resources  may  have  other  effects, 
which  are  not  entirely  reflected  in  changed  ob- 
jectives; they  will  not  necessarily  lead  to  the 
design  for  the  Northern  Territory.  This  section 
contains  general  comments  on  some  of  the  situations 
already  included  as  examples  in  the  Agrostan  design, 
and  on  other  possibilities  not  yet  discussed. 

4.1    No  previous  agriculture  census 

In  a  country  which,  like  the  Northern  Terri- 
tory of  Agrostan,  had  no  previous  census  of  agri- 
culture, the  sample  will  probably  be  less  efficient 
and  a  larger  sample  will  be  needed  for  given 
levels  of  accuracy.  Stratification  may  be  less 
effective,  and  cutoffs  for  defining  the  larger 
holdings  (which  are  to  be  selected  with  certainty) 
may  be  more  difficult  to  choose .  The  minimum 
criteria  for  defining  a  holding  may  have  to  be 
set  lower  in  order  to  be  certain  not  to  exclude 
too  much  of  the  production.  Much  more  pretesting 
of  questionnaires  and  procedures  will  be  required. 
If  there  has  been  a  previous  population  census 
with  well  identified  EA's,  the  sample  design  may 
be  similar  to  that  used  in  the  Provinces  of  Agro- 
stan. However,  the  samples  should  be  slightly 
larger  in  each  area  requiring  estimates,  in  order 
to  offset  the  reduced  efficiency  of  basing  the 
sample  on  population  counts  which  may  not  be  closely 
related  to  holdings . 


4.2  No  previous  complete  population  census 

For  Agrostan,  it  was  assumed  that  a  complete 
population  census  would  be  taken  shortly  before 
the  Agriculture  Census .  This  Census  would  provide 
a  frame  for  selecting  the  agriculture  sample . 
However,  in  some  countries  the  population  census 
might  be  taken  on  a  sample  basis.   In  that  case, 
the  frame  used  to  select  PSU's  for  the  population 
sample  may  be  usable  to  select  PSU's  for  the  agri- 
culture sample  census .   If  those  units  were  too 
large  or  too  variable  in  terms  of  numbers  of  hold- 
ings for  an  efficient  agriculture  sample,  map 
work  and  field  work  may  be  required  to  modify  them. 
The  field  work  can  be  reduced  by  the  procedure 
applied  in  the  Provinces  of  Agrostan,  using  rough 
measures  of  size,  based  on  the  best  available 
information,  and  selecting  the  sample  before  sub- 
dividing units  which  have  measures  greater  than 
one;  this  procedure  limits  the  work  of  subdividing 
to  only  the  selected  units  (see  chapter  3). 

For  a  given  level  of  publication  and  degree 
of  accuracy  a  larger  sampling  fraction  will  usu- 
ally be  needed  for  a  sample  agriculture  census 
than  for  the  sample  population  census,  since  not 
all  households  are  associated  with  holdings,  and 
agricultural  data  are  often  more  variable  than 
population  data.  Hence  it  would  usually  not  be 
feasible  to  select  the  sample  for  the  agriculture 
census  by  including  only  a  subsample,  or  even  all, 
of  the  population  sample  households  associated  with 
agricultural  holdings . 

4.3  Simultaneous  sample  censuses  of  population 

and  agriculture 

In  some  cases,  it  may  be  advisable  to  take 
a  sample  population  census  and  a  sample  agricul- 
ture census  simultaneously  or  nearly  so.  A  list 
of  all  households  in  the  country  or  a  complete 
list  of  households  within  selected  PSU's,  together 
with  identification  of  households  having  agri- 
cultural holders,  can  provide  the  frames  for 
both  samples.  With  a  Listing  Book  such  as  that 
used  in  the  Provinces  of  Agrostan,  for  example, 
column  3  entries  identify  households,  and  column 
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7  entries  identify  holdings .  The  population 
sample  might  include  1  in  10  of  the  households  in 
column  3,  and  the  agriculture  sample  might  include 
1  in  4-  of  the  entries  in  column  7.  Then  the  prob- 
ability that  any  household  would  be  in  both  samples 
would  be  only  about  1  in  40.  The  person  making  the 
list  might  also  fill  the  population  census  ques- 
tionnaire. Another  enumerator  could  conduct  the 
interview  for  the  agriculture  questionnaire  on  a 
separate  visit.  This  will  reduce  the  training 
burden  on  the  interviewers  resulting  from  the 
variety  of  concepts  involved,  and  it  will  reduce 
the  burden  placed  on  respondents  at  any  one  time; 
it  would  also  assure  that  very  few  respondents 
would  be  in  both  samples .   If  it  is  feared  that 
duplication  of  the  same  households  in  both  samples 
will  reduce  cooperation,  unbiased  procedures  can 
be  used  to  eliminate  the  duplicates .  The  sampling 
rate  would  be  increased  to  offset  rejected  dupli- 
cates . 

Units  to  be  included  with  certainty  would 
generally  not  be  the  same  for  the  population  and 
agriculture  samples .   Institutional  units  would 
not  necessarily  be  associated  with  large  holdings 
and  large  holdings  would  usually  not  have  large 
populations .   National  and  Local  Certainty  hold- 
ings would  be  identified  as  in  Agrostan,  and  lists 
of  the  former  would  be  compiled  in  advance.  These 
would  be  checked  and  the  Local  Certainties  would 
be  identified  from  information  obtained  during  the 
listing  operation.  This  operation  would  be  per- 
formed by  the  population  sample  enumerator  if  the 
division  of  labor  suggested  above  is  followed. 
The  data  collection  form  for  the  listing  operation 
would  be  somewhat  longer,  in  order  to  obtain  the 
information  needed  for  differential  sampling  of 
larger  units  for  both  samples . 

4.4  More  detailed  maps  available 

If  maps  are  available  showing  an  indicated 
number  of  holdings  related  to  identifiable  areas 
smaller  than  Agrostan 's  EA's,  they  may  be  used  to 
improve  the  efficiency  of  the  sample,  even  though 
the  indicated  numbers  are  inaccurate  and  out-of- 
date.  These  numbers  should,  however,  be  fairly 


well  correlated  with  actual  numbers  of  holdings . 
Small,  compact  areas  with  identifiable  boundaries 
(hereafter  called  "segments")  can  be  formed  on 
maps  and  used  as  single-stage  or  second-stage 
sampling  units.  Several  variations  are  possible, 
as  discussed  below.  All  these  methods  reduce  the 
amount  of  listing  required,  while  increasing  the 
amount  of  map  work. 

4.4-1  Single-stage  segments  with  single  meas- 
ures of  size. --In  the  simple  case  of  a  single 
stage  sample,  there  is  a  large  amount  of  map  work. 
The  map  of  each  EA  (or  other  small  administrative 
unit)  is  divided  into  segments  containing  approxi- 
mately equal  numbers  of  holdings.1  Definite  limits 
would  be  set  on  the  expected  number  of  holdings  per 
segment — for  example,  3  to  9  (average  6)  or  5  to 
15  (average  10),  depending  on  the  amount  of  detail 
usable  for  segment  boundaries  on  the  maps .  A 
segment  averaging  more  than  8  or  10  holdings  would 
probably  not  be  efficient,  since  holdings  which  are 
located  near  each  other  are  likely  to  be  similar. 
A  geographically  ordered  sample,  probably  strati- 
fied by  urban-rural  residence,  is  selected  sys- 
tematically. 

4.42  Single-stage  segments  with  varying 
measures  of  size. --If  map  detail  is  not  sufficient 
to  make  identifiable  segments,  all  of  which  are 
close  to  the  desired  size,  the  segments  are  allowed 
to  vary  in  size  and  additional  measures  of  size 
are  assigned  to  those  above  the  desired  range . 
For  example,  suppose  it  is  desired  to  have  seg- 
ments of  about  6  holdings  (the  number  6  might  be 
chosen  because  it  is  the  number  of  holdings  an 
enumerator  can  be  expected  to  interview  in  one 
day,  including  time  for  travel  to  the  segment  and 
possibly  for  contacting  an  official) .  Segments 
could  be  formed  containing  an  estimated  6  holdings, 
and  all  segments  in  the  range  of  4  to  8  holdings 
would  have  single  measures  of  size.  A  segment  in 
the  range  of  9  to  15  holdings  would  have  a  double 
measure  of  size.  Segments  with  more  than  15  ex- 


In  practice,  segments  might  "be  set  up  according  to  num- 
"bers  of  housing  units  (HU1  s)  ;  assumptions  can  "be  made  as 
to  the  relationship  of  HU' s  and  holdings. 
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pected  holdings  could  be  subdivided  before  the 
sample  selection  is  made,  or  assigned  larger 
measures  of  size . 

Segments  are  then  selected  systematically 
from  the  cumulated  measures  of  size,  following  the 
procedure  used  for  EA's  in  chapter  3.   If,  for 
example,  a  segment  with  a  double  measure  is  selec- 
ted, there  is  a  choice  of  two  procedures,  each 
resulting  in  a  self -weighted  sample : 

(1)  The  enumerator  or  supervisor  divides  the 
double  segment  into  two  parts  in  the  field, 
using  visible  boundaries.  One  part  is  se- 
lected at  random  for  the  sample. 

(2)  The  entire  segment  is  listed,  and  a  sample 

of  1  in  2  holdings  is  selected  from  the  list. 

-4.43  Segments  as  second-stage  units. — When 
the  number  of  segments  that  would  be  required  for 
specified  accuracy  of  a  publication  area  is  large, 
it  may  be  feasible  to  select  segments  in  two 
stages.  With  two-stage  sampling,  a  somewhat  lar- 
ger number  of  holdings  and  segments  would  be  re- 
quired; two-stage  sampling  would  be  used  only  if 
there  would  be  a  net  saving  resulting  from  the 
lower  travel  costs  for  a  given  accuracy  level. 
The  number  of  sample  PSU's  in  a  district  should 
not  be  smaller  than  about  20. 

When  segments  are  used  as  second-stage  units 
in  this  way,  it  is  often  convenient  to  select 
first-stage  units  with  probability  proportional 
to  size  (based  on  the  expected  number  of  holdings) 
and  to  select  segments  with  probability  inversely 
proportional  to  size .  This  eliminates  the  need 
for  equalizing  the  size  of  PSU's,  and  gives  a 
self -weighted  sample,  with  approximately  equal 
workload  (number  of  segments  and  number  of  hold- 
ings) in  each  selected  PSU.  It  also  reduces  the 
error  due  to  variability  in  PSU  size,  resulting 
from  the  fact  that  it  is  often  not  possible  to 
make  the  PSU's  nearly  equal  in  size  (see  section 
5.5  below).   It  also  eliminates  the  need  for  seg- 
menting (dividing  into  segments)  the  entire  area 
for  which  estimates  are  required.  Segmenting  will 
be  done  only  in  the  selected  PSU's. 

In  the  example  above,  the  segments  may  vary 
in  size  fully  as  much  (relatively)  as  the  PSU's 


in  Agrostan.  The  variability  in  segment  size  does 
not  have  as  serious  an  effect  on  accuracy  of  esti- 
mates, however,  because  the  number  of  sample  seg- 
ments in  an  area  requiring  estimates  is  many  times 
as  large  as  the  sample  number  of  PSU's. 

A-.A4-     Use  of  cadastral  survey  maps. --In  some 
countries,  detailed  maps  showing  land  utilization 
by  small  plots  and  by  ownership  are  available  for 
revenue  purposes.  They  may  have  serious  errors 
or  biases  in  areas  reported,  but  still  may  provide 
supplemental  information  which  can  be  used  to  im- 
prove sampling  or  estimating  methods .  Because 
they  are  on  an  ownership  rather  than  an  operator 
(holding)  basis,  and  because  of  their  identification 
with  the  revenue  system,  it  may  not  be  feasible  to 
use  them  as  a  source  of  individual  holding  infor- 
mation for  sample  selection. 

The  cadastral  registers  or  area  data  which 
are  usually  associated  with  such  maps  may  be 
available  to  provide  information  needed  to  assign 
measures  of  size  to  PSU's,  or  as  supplemental 
information  in  ratio  estimates  (on  the  latter  use, 
see  section  A. 8   below). 

4.5   Less  detailed  maps  available 

The  Northern  Territory  of  Agrostan  provides 
an  example  of  the  situation  in  which  available 
maps  show  too  little  detail  to  be  useful  in 
forming  PSU's,  or  even  in  assigning  census  enu- 
meration work.   Instead,  a  list  of  administrative 
areas,  with  their  preliminary  1960  population 
counts,  was  used  as  a  starting  point  for  preparing 
a  frame  of  PSU's. 

The  procedure  is  to  obtain  as  complete  a  list 
of  areas  as  possible,  with  the  help  of  field  work 
if  necessary,  and  to  obtain  whatever  information 
is  available  on  their  size  (number  of  holdings  or 
some  quantity  correlated  with  the  number  of  hold- 
ings). PSU's  can  then  be  selected  with  unequal 
probabilities,  following  the  method  of  the  Northern 
Territory  or  with  probability  proportional  to  the 
measure  of  size .  By  keeping  the  overall  probability 
equal  for  the  selection  of  holdings,  the  sample 
can  be  made  self -weighting. 
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4.6  More  money  and  staff  available 

The  census  objectives  of  accuracy  and  type 
of  detail  to  "be  published  require  a  certain  amount 
of  personnel  and  budgetary  resources.   Increased 
availability  of  these  resources  would  permit 
carrying  out  some  experiments  in  sample  design 
as  part  of  the  Census ;  the  purpose  would  be  to 
obtain  information  which  will  permit  the  design 
of  more  efficient  future  samples  for  intercensal 
surveys  or  later  censuses,  and  to  study  nonsam- 
pling  errors .  Some  such  research  features  are 
described  in  the  next  section. 

4.7  Use  of  interpenetrating  subsamples 

Sometimes  it  is  useful  to  design  the  sample 
so  that  it  can  be  divided  into  a  number  of  inde- 
pendent subsamples  of  the  same  design- -called 
"interpenetrating"  samples,  which  can  serve  sev- 
eral possible  purposes .  Some  of  these  purposes 
and  the  applicable  methods  are  discussed  below. 
The  first  two  methods  are  applicable  to  the  Agro- 
stan  sample  as  originally  designed. 

4.71  Successive  estimates . --Preliminary 
district  estimates  can  be  made  from  a  systematic 
subsample  of  the  entire  district  sample .  Because 
of  the  importance  of  "certainty"  holdings,  the 
interpenetrating  subsamples  would  be  identified 
only  among  the  small  holdings,  and  all  certainty 
holdings  would  be  included  in  each  successive 
estimate .  Thus ,  for  example ,  each  small  holding 
in  the  sample,  might  be  identified  as  bexonging  to 
one  of  four  subsamples,  each  obtained  by  starting 
with  a  different  random  number  from  1  to  4.  This 
would  be  done  in  every  sample  PSU  after  making 
duplicates  for  noninterviews .   If  we  call  the 
four  subsamples  of  small  holdings  A,  B,  C,  and  D, 
successive  estimates  would  be  based  on  subsamples 
A;  A  and  B;  A,  B,  and  C;  and  A,  B,  C,  and  D;  plus 
(in  every  case)  the  certainty  holdings.  For  the 
first  such  district  estimate,  sample  totals  would 
be  combined  as  follows  to  obtain  estimates  of 
aggregates . 


1st  estimate : 

Enumerated  total  for  National  Certainty 
holdings 

+  4  x  (enumerated  total  for  Local  Certainty 
holdings) 

+  64  x  (enumerated  total  for  sample  small 
holdings  in  A  group)  =  64  S  A  =  A' 

where  the  multiplying  factors  are  reciprocals 
of  the  probabilities  of  selection. 

For  the  second  and  successive  estimates,  the 
estimate  for  certainty  holdings  would  remain 
unchanged,  but  the  estimate  for  small  holdings 
would  be  an  average  of  independent  estimates 
from  the  subsamples . 

2nd  estimate  of  small  holdings : 

(A'  +  B')  -r  2  where  B'  (which  is  obtained  by 
the  same  estimating  formula  as  A')  =  64  SB. 

3rd  estimate  of  small  holdings : 

(A'  +  B'+C')t3  where  C  =  64  EC. 

4th  estimate  of  small  holdings : 

(A'  +  B'  +  C  +  D')  t  4  where  D'  =  64  ED. 

Since  the  number  of  sample  small  holdings  in  a  dis- 
trict often  would  not  be  exactly  divisible  by  4, 
more  exact  expansion  factors  should  be  based  on 
the  exact  number  of  small  holdings  in  the  groups . 
For  example,  if  there  were  387  sample  small  hold- 
ings in  a  district,  the  A,  B,  C,  and  D  groups  would 
vary  by  one  unit  and  would  consist  of  96,  97,  97, 
and  97  instead  of  identical  numbers .  Then  the 
formulas  for  estimating  the  aggregates  A',  B',  etc., 
for  small  holdings  would  be : 


A'  =  (lr*  '■ 


/387 
\97 


x  16 


>)Ea; 


etc  . 


The  difference  between  A'  and  B'  can  provide 
the  basis  for  a  very  rough  estimate  (one  degree  of 
freedom)  of  the  sampling  error  in  the  second  esti- 
mate for  small  holdings : 

s(of  second  estimate  of  small  holdings)  = 
s2  -r  VF, 

where  s  is  the  estimated  population  variance 

of  the  expanded  estimates,  A'  and  B1;  and  n  is 
the  number  of  such  expanded  estimates  which  are 
combined  (namely,  2).  Since  the  second  estimate 
of  small  holdings  is  (A'  +  B')  -f  2, 
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s  = 


[A,.i-pi]'.+  [l 


(A1  +  B1 


n  -  1 


(A'  -  B')"   and 
2 


_  |A'  -  B'| 
2  "    V2 


ation  of  A'  and  B'. 


Thus  s  (of  second  estimate  of  small  holdings)  = 

s3  _  |A'  -  B'| 
Vn"       2 

The  term  |a>  -  B'|  refers  to  the  absolute  value 
of  the  difference  (A1  -  B'). 

Similarly,  the  three  estimates  of  A',  B',  C 
lead  to  an  estimate  of  the  population  variance 


3    2 


(A.  ,A'+  B;+  C'J-h(b'  -A'+  B3'  +  C'J 
+  (C,  -  A'  ±  |!  ±   Oj 

based  on  two  degrees  of  freedom. 

s  (of  third  estimate  for  small  holdings)  = 
/estimated  standard  deviation  of) 


3_ 

Vn~ 


A' ,  B' ,  and  C1 


VT 


s  (of  fourth  estimate  for  small  holdings)  = 

/estimated  standard  deviation  of 
S4  _\ A',  B'  ,  C,  and  D1 

VrT  V~T=  2 

This  procedure  does  not  provide  an  estimate 
of  the  component  of  sampling  error  contributed  by 
the  Local  Certainty  holdings . 

4.72  Evaluation  of  sampling  error. --As  de- 
scribed in  section  3  of  chapter  7,  the  Agrostan 
sample  will  be  divided  into  independent  subsamples 
for  the  purpose  of  estimating  sampling  error,  even 
when  no  successive  estimates  are  made.  That  divi- 
sion provides  error  estimates  for  the  total  esti- 
mate (not  for  small  holdings  only),  and  it  is  based 
on  more  degrees  of  freedom  than  are  available  by 
the  procedure  described  above . 


4.73  Evaluation  of  "enumerator  effect". — The 
usual  and  least  expensive  way  to  distribute  the 
interviewing  work  among  the  enumerators  is  to  assign 
each  enumerator  a  separate  area  such  as  a  primary 
sampling  unit,  or  a  group  of  primary  sampling  units, 
as  in  Agrostan.  However,  it  is  also  possible  to 
assign  the  sample  holdings  within  one  or  more  PSU's 
to  two  enumerators .   If  the  two  groups  of  holdings 
are  selected  so  that  each  is  an  independent  sub- 
sample,  and  these  samples  are  large  enough,  an 
analysis  of  the  results  obtained  by  the  two  enumer- 
ators can  provide  a  means  of  evaluating  the  "enu- 
merator effect".  This  term  (also  called  "inter- 
viewer effect")  refers  to  the  differences  which 
cannot  be  explained  as  sampling  error,  but  must  be 
attributed  to  consistent  tendencies  of  individual 
enumerators  to  obtain  different  results  from  the 
same  population.   If  the  entire  sample  is  thus 
divided  among  pairs  of  enumerators,  it  is  possible 
to  separate  an  enumerator  variance  as  a  component 
of  the  total  variance.  Travel  costs  are,  of  course, 
greater  than  for  the  usual  sample  design. 

In  a  number  of  experiments,  this  method  has 
revealed  surprisingly  large  variability  related  to 
enumerators,  presumably  due  to  their  attitudes, 
preconceptions,  or  to  different  ways  of  asking 
questions  or  recording  answers.  These  studies 
have  led  to  the  conclusion  that,  in  addition  to 
having  at  least  a  moderately  large  number  of  sam- 
ple PSU's  in  a  publication  area,  there  should  be 
a  moderately  large  number  of  enumerators,  so  that 
enumerator  differences  can  balance  out  .  From 
this  viewpoint,  a  large  number  of  part-time  enu- 
merators would  be  better  than  a  smaller  number  of 
full-time  enumerators.  Where  it  is  feasible,  self- 
enumeration  (having  the  respondent  fill  the  ques- 
tionnaire himself)  with  follow-up  of  nonrespondents 
may  give  better  results  than  interviews . 

It  is  theoretically  possible  to  adapt  a  sample 
design  such  as  the  Agrostan  design  for  the  purpose 
of  studying  the  enumerator  effect  merely  by  as- 
signing enumerators  so  that  two  or  more  have  inde- 
pendent subsamples  within  the  same  territory.  Us- 
ually it  is  more  efficient,  however,  to  design  the 
sample  somewhat  differently,  because  of  the  in- 
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creased  costs  from  having  two  enumerators  working 
in  the  same  area.  Larger  units  might  be  needed 
for  PSU's  or  PSU's  might  be  chosen  in  contiguous 
pairs  to  minimize  the  additional  travel  inherent 
in  this  type  of  interpenetrating  sample . 

4.8   Use  of  first-stage  ratio  estimates 

Reference  was  made  in  section  7.3  of  chapter 
6  to  the  possibility  of  using  a  ratio  estimating 
method  to  improve  estimates.  The  method  is  equi- 
valent to  multiplying  the  original  estimate  (x'), 
obtained  from  a  simple  expansion  of  the  sample 
data,  by  a  factor  which  will  adjust  for  the  fact 
that  a  particular  sample  which  was  selected  will 
be  "higher"  (or  "lower")  than  the  whole  population. 
The  factor  is  a  known  total  for  the  whole  popula- 
tion (Y)  divided  by  the  estimate  (y1)  of  the  same 
known  total,  which  is  made  by  using  this  particular 
sample.  Both  Y  and  y'  are  based  on  data  obtained 
from  a  source  which  is  independent  of  the  current 
survey  (for  example,  from  a  previous  census  or 
cadastral  records).  In  the  method  suggested  in 
chapter  6,  the  known  information  (not  used  in  the 
simple  estimating  method)  was  the  1970  population 
for  the  sample  PSU's  and  for  all  FSU's.  This  pro- 
cedure is  called  a  "first-stage  ratio  adjustment," 
since  it  is  based  on  data  for  PSU's  only  (not 
individual  holdings ) .  The  resulting  estimate  is 


""'&) 


Where  there  are  different  weight  groups  and 
National  Certainty  (NC)  holdings  are  selected  from 
all  PSU's,  care  must  be  taken  in  applying  the 
ratio  adjustment.   National  Certainty  holdings  must 
be  treated  as  a  separate  universe  not  requiring 
adjustment.  If  they  comprise  a  substantial  part 
of  the  total  to  be  estimated,  it  may  be  necessary 
to  subtract  the  value  of  the  supplementary  vari- 
able for  NC  holdings  from  the  numerator  and  denomi- 
nator of  the  adjustment  factor.  This  may  not  be 
necessary  if  the  supplementary  variable  is  popu- 
lation or  number  of  holdings,  but  would  be  desir- 
able if  it  is  an  aggregate  such  as  land  in  hold- 
ings. For  example,  in  making  a  district  estimate 


of  an  aggregate  for  a  variable  X  in  one  of  the 
Provinces  of  Agrostan  by  this  method,  using  land 
in  holdings  as  the  supplementary  variable,  the 
following  steps  would  be  involved : 

(1)  Adjust  each  weight  group  for  noninterviews 
(small  holdings,  Local  Certainty  holdings, 
and  National  Certainty  holdings ) . 

(2)  Obtain  corresponding  sample  totals  of  X 
adjusted  for  noninterviews . 

(3)  Multiply  each  sample  total  by  its  appropriate 
weight  (16;  4;  1)  to  obtain  estimated  aggre- 
gates (xT )  . 

(4)  Add  the  estimates  (xT)  for  the  small  holdings 
and  Local  Certainty  holdings  to  obtain  the 
universe  estimate  excluding  National  Cer- 
tainty holdings . 

(5)  Compute  the  first-stage  ratio  factor  based 
on  land  in  holdings : 

Total  land  in  holdings  in  the  district 

excluding  land  in  NC  holdings Y 

(Land  in  holdings  in  sample  PSU's  exclud-  ~  y1 
ing  land  in  NC  holdings2)  x  4. 

(6)  Multiply  the  sample  estimated  total  exclud- 
ing National  Certainty  holdings  obtained  in 
step  (4)  by  the  factor  obtained  in  step  (5). 

(7)  Add  the  total3  for  X  for  National  Certainty 
holdings  in  step  (2)  to  the  adjusted  total 
obtained  in  step  (6). 

An  example  of  another  supplementary  variable 
which  might  be  used  for  the  first-stage  ratio  esti- 
mate is  the  number  of  holdings  at  a  previous  cen- 
sus date . 

Before  deciding  whether  to  use  a  ratio  esti- 
mating method,  some  rough  analysis  should  be  made 
by  the  sampling  statistician,  considering  the  ex- 
pected correlation  between  important  items  being 
estimated  and  the  supplementary  variable,  as  well 
as  expected  variances  of  each.  The  correlation 
must  be  reasonably  large  (for  example,  +.7  or  more), 
and  the  relative  variance  of  the  supplementary 
variable  must  not  be  much  larger  than  that  of  the 
items  estimated  in  order  to  obtain  enough  improve- 
ment to  justify  the  additional  work. 


That   is,    land   in  National  Certainty  holdings  associated 
with  the   sample  PSU1  s  only. 

This  is  a  total  rather  than  a  sample  estimate,    since 
all  National  Certainty  holdings  are  in  the   sample.      The 
adjustment   for   (possible)    non-responses  should  "be  so  small 
that  this  can  still  "be  considered  a  "total",    rather  than 
an  estimate. 
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5.    OTHER  CONSIDERATIONS  AFFECTING 
SAMPLE  DESIGN 

In  addition  to  census  objectives  and  avail- 
able resources,  a  country  may  differ  in  many  ways 
from  Agrostan.  The  sample  design  should  reflect 
the  differences . 

5.1  Number  and  definition  of  weight  groups 

In  any  sample  design,  the  number  of  different 
weight  groups  should  be  kept  as  small  as  possible; 
they  should  be  defined  in  a  manner  that  makes  it 
easy  to  classify  the  holdings.  For  example,  it 
would  be  desirable  tc  use  a  single  definition  of 
Local  Certainty  holdings,  if  the  proportion  of 
holdings  in  the  various  size  groups  does  not  differ 
greatly  from  area  to  area  within  the  country. 

Even  if  the  number  of  large  and  medium-size 
holdings  differs  widely  throughout  the  country,  a 
cutoff  point  could  be  used  which  is  lower  than 
necessary  for  some  areas.   This  would  give  a  uni- 
form field  procedure  and  simplify  tabulation  and 
control,  but  would  require  a  somewhat  larger  sam- 
ple . 

5.2  Holdings  operated  by  city  residents 

It  has  been  assumed  throughout  the  Agrostan 
case  study  that  an  agriculture  census  will  obtain 
data  relating  to  holdings — that  is,  on  an  operator 
basis.  The  further  assumption  made  in  Agrostan-- 
that  it  is  feasible  to  sample  holdings  by  way  of 
operators'  households --may  not  always  be  appro- 
priate .  In  the  definition  of  the  universe  for 
Agrostan,  it  was  assumed  that  holdings  of  opera- 
tors living  in  the  built-up  parts  of  the  cities 
(plus  those  of  less  than  yrr- hectare )  could  be  ex- 
cluded with  only  a  slight  loss  of  coverage  of  the 
total  agricultural  production. 

In  a  country  or  part  of  a  country  where  a 
substantial  number  of  holders  live  in  the  city  and 
operate  holdings  outside  the  city,  some  other 
method  must  be  used  to  provide  a  satisfactory  frame 
of  holders  (or  holdings).   It  is  usually  not 
feasible  to  canvass  all  housing  units  in  the  cities, 
in  order  to  find  those  having  a  household  member 


who  operates  a  holding.   It  may  be  feasible  to  ask 
a  question  in  a  population  census  taken  shortly 
before  (or  concurrent  with)  the  agriculture  census, 
which  will  identify  the  households  of  such  absentee 
agricultural  operators,  and  thus  provide  a  frame 
for  sampling  them. 

If  this  procedure  cannot  be  applied,  it  may 
be  very  difficult  to  obtain  complete  coverage  of 
these  holdings,  and  to  assure  that  they  have  a 
chance  of  being  selected  in  the  sample.  A  possi- 
ble procedure  is  to  list  not  only  the  heads  of 
households  living  in  the  sample  enumeration  areas, 
but  also  every  person  not  living  there  who  has 
agricultural  operations  in  the  enumeration  area.4 
By  checking  "yes"  or  "no"  answers  to  various 
questions,  the  enumerator  can  identify  persons 
with  holdings  in  the  enumeration  area  but  living 
outside  the  enumeration  area,  into  one  of  the 
following  classes : 

(1)  Those  who  have  agricultural  operations  where 
they  live . 

(2)  Those  who  do  not  have  agricultural  operations 
where  they  live . 

For  the  latter,  the  enumerator  obtains  the  address, 
and  a  follow-up  is  made  to  obtain  a  questionnaire . 
For  the  former,  no  follow-up  is  needed,  since  it 
is  assumed  that  these  holders  will  be  included  in 
the  census  where  they  live . 

Similar  methods  have  been  used  in  connection 
with  segment  samples,  where  it  may  be  somewhat 
easier  to  account  for  all  agricultural  land,  and 
thus  get  full  coverage  of  the  absentee  operators. 

5.3   Communal  forms  of  tenure 

As  described  in  Unit  III,   there  are  various 
forms  of  tribal,    collective,    and  cooperative 
tenure  under  which  agricultural  land  is  held. 
These  variations  affect  questionnaire  design  and 
methods   of  data  collection  more   than  they  affect 
sample  design.      It  is  assumed  that  the  larger 
communal  holdings  would  be  comparable   to  National 


Same  absentee  holders  may  have  agricultural  operations 
in  several  EA.'s;   to  avoid  duplication,    some  rule  should  he 
established,    such  as  recording  the  holding  only  in  the  EA. 
where  the  largest  portion  is  located. 
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PSU 
(or  EA) 


or  Local  Certainty  holdings  in  Agrostan,  and  that 
there  is  a  "person-in-charge"  who  can  be  treated 
as  the  operator.  Where  separate  pieces  of  land  are 
individually  owned  by  or  assigned  to  members  of  a 
cooperative,  these  individuals  will  be  treated  as 
the  operators .   If  individual  holdings  consist  only 
of  "kitchen  gardens"  for  family  use,  while  the  major 
production  is  from  communally  operated  land,  treat- 
ment would  depend  on  whether  Census  objectives  in- 
clude information  about  such  gardens.   If  so,  the 
questionnaire  for  the  communal  operation  should  be 
supplemented  by  a  form  for  getting  information  from 
all  or  part  of  the  households  in  the  cooperative. 

5.4  Use  of  "villages"  or  "barrios"  as  a  frame 

of  PSU' s 

In  many  countries  the  smallest  administrative 
unit  for  which  there  is  a  complete  list  (and  for 
which  there  is  likely  to  be  data  avail- 
able which  can  be  used  in  designing  the 
sample)  is  a  group  of  inhabitants  with 
some  tribal  or  ethnic  ties  (known  as  a 
"village"  in  some  Asian  and  African 
countries)  or  as  a  minor  civil  division 
(such  as  a  "barrio")  in  some  Latin 
American  countries.  The  unit  may  con- 
tain a  concentration  of  households  sur- 
rounded by  more  scattered  homes .  Such 
administrative  units  are  similar  to  the 
"chief doms"  of  Agrostan' s  Northern 
Territory.  Since  they  are  likely  to  be 
quite  variable  in  population  size,  the 
sampling  method  used  in  the  Northern 
Territory  or  the  method  of  sampling  with 
probability  more  exactly  proportional 
to  size,  as  described  below,  may  be  used. 

If  these  units  do  not  comprise  a 
frame  covering  all  possible  holdings  in 
the  population,  procedures  must  be  devised  to  se- 
lect a  sample  of  those  which  are  not  included . 

5.5  Sampling  with  probability  proportional  to 

size 

In  Agrostan  it  was  assumed  that  information 
was  available  on  the  size  of  PSU's,  which  to  some 
extent  reflected  the  numbers  of  holdings .  Since 


actual  information  on  numbers  of  holdings  was  not 
available,  population  was  used.   It  was  assumed 
that,  within  the  agricultural  universe  (which 
excluded  densely  populated  city  areas)  population 
would  be  correlated  with  number  of  holdings .   In 
selecting  PSU's  from  cumulated  measures  of  size 
assigned  to  EA's  in  the  Provinces,  we  used  a  se- 
lection probability  roughly  proportional  to  the 
size  of  EA's  as  a  device  to  reduce  map  work.   In 
the  Northern  Territory,  where  PSU's  were  more 
variable  in  size,  they  were  classified  in  several 
size  groups,  and  the  probabilities  of  selection 
were  more  closely  related  to  size.  Still  another 
procedure  is  to  use  probability  proportional  to 
a  measure  of  size  (PPS).  The  difference  between 
these  methods  is  shown  in  the  following  hypothe- 
tical example . 
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Probabilities  giving  20  sample  PSU's  from  the  total  of  80. 


A  self -weighting  sample  is  obtained  in  all  three 
cases,  by  using  at  the  second -stage  a  probability 
(sampling  rate)  which  is  the  inverse  of  that  used 
at  the  first-stage,  thus  giving  a  constant  overall 
probability.  The  advantage  of  using  PPS  is  that 
(if  the  population  count  is  highly  correlated  with 
the  number  of  agricultural  operations)  it  results 
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in  a  more  nearly  equal  workload  of  interviews  (num- 
ber of  sample  holdings)  for  each  sample  PSU,  and  usu- 
ally leads  to  a  smaller  variance  in  the  estimates . 
If  more  than  one  unit  is  selected  from  an  en- 
tire stratum  with  PPS  and  without  replacement  the 
second  and  succeeding  units  are  biased  toward  small- 
er units  (since  larger  units  are  likely  to  be  selec- 
ted first  and  do  not  have  their  proper  probability 
in  later  selections ) .  There  are  several  methods  of 
avoiding  this  bias.  A  simple  one  consists  in  se- 
lecting systematically  with  PPS.   In  the  hypothe- 
tical example  above,  the  interval  for  systematic 
selection  would  be  — ^ —  =  2,500.  A  random  num- 
ber from  0001  to  2500  is  the  starting  number  and 
it  selects  the  first  PSU  for  which  the  cumulated 
population  total  equals  or  exceeds  the  random 
starting  number.  Successive  sample  PSU's  are  iden- 
tified by  repeatedly  adding  the  interval  2500  to 
the  random  starting  number  and  finding  the  selec- 
ted PSU  from  the  cumulated  population  number.5 


The  procedure  is  discussed  and  illustrated  in  Lecture  10 
of  the  following  pub] ication  of  the  U.S.  Bureau  of  the  Cen- 
sus.  Supplemental  Courses  for  C&se  Studies  in  Surveys  and 
Censuses,  Sampling  Lectures,  ISP  Supplemental  Course  Series, 
No.  1,  Washington,  D.C. ,  1968. 


5.6   Complete  agriculture  census  to  be  taken 

Even  when  a  complete  (or  nearly  complete) 
agriculture  census  is  to  be  taken,  some  of  the 
principles  of  sample  design  discussed  above  can  be 
used  for  one  or  more  of  the  following  purposes : 

(1)  To  extend  the  scope  of  the  census  (see  sec- 
tion 2.3  above) 

(2)  To  specify  a  sample  for  early  tabulation 
(see  section  4.71  above) 

(3)  To  study  the  "enumerator  effect"  (see  sec- 
tion 4.73  above) 

(4)  To  specify  a  sample  for  a  post-enumerative 
survey  to  check  on  the  quality  of  census 
data. 

(5)  To  specify  a  sample  of  small  holdings,  to 

be  taken  with  a  complete  census  of  all  larger 
holdings .  This  would  be  similar  to  the  Agro- 
stan  design  in  form,  but  the  definition  of 
certainty  holdings  would  have  a  much  lower 
cut-off,  for  example  10  hectares.  Holdings 

between  -r-n   hectare  and  10  hectares  would  be 

sampled,  and  those  of  over  10  hectares  in- 
cluded with  certainty. 

(6)  To  specify  a  sample  for  an  intercensal  sur- 
vey to  follow  the  census,  for  obtaining  cur- 
rent agricultural  estimates  - 


Chapter  IV-9.    OTHER  USES  OF  SAMPLING 


1.    INTRODUCTION 

The  use  of  sampling  in  the  field  enumeration 
activities  of  a  census  is  one  of  the  major  uses 
of  sampling.  However,  there  are  a  number  of  other 
areas  in  which  sampling  can  be  used  in  the  census 
program.  These  other  uses  include:   (a)  pretesting, 
(b)  quality  control  of  pre -enumeration  activities, 
field  enumeration,  and  processing,  and  (c)  the 
evaluation  program. 

2.    SAMPLING  IN  PRETESTING 

For  a  large  operation  such  as  a  census,  plans 
and  procedures  must  be  carefully  developed.  Word- 
ing of  the  questions  and  the  design  of  the  ques- 
tionnaire, as  well  as  the  check-in  procedures  and 
data -processing  operations,  should  be  pretested 
before  they  are  used  in  the  census .  Time  and  cost 
to  carry  out  a  function  also  must  be  analyzed  in 
the  pretests . 

Not  all  aspects  of  pretesting  are  suited  to 
probability  sampling.  Pretesting  of  the  questions 
to  be  asked  may  require  a  different  selection  of 
respondents  than  that  needed  for  obtaining  data 
on  the  time  and  cost  of  performing  the  enumeration. 
Likewise,  an  entirely  different  sample  may  be 
appropriate  for  pretesting  an  office  operation. 
Properly  designed  samples  in  the  pretest  may  assist 
in  making  the  most  suitable  decisions . 

2.1    Pretesting  questionnaires  and  other  forms 

After  a  decision  has  been  made  as  to  the  in- 
formation wanted  from  the  census,  it  is  necessary 
to  determine  the  order  of  asking  the  questions, 
and  the  wording  which  will  obtain  the  answer  most 
easily  and  most  correctly.  Normally  a  proposal 
is  made  after  consultation  between  the  data  users 
and  the  subject  matter  specialists  in  the  NSO. 


The  proposal  may  involve  several  alternative  ver- 
sions of  the  questionnaire  with  different  question 
wording.  Several  pretests  may  be  made  to  test 
different  groups  of  questions  and  to  make  succes- 
sive changes  in  questions .  Some  of  this  pretesting 
may  be  done  first  in  the  NSO  office. 

It  is  then  necessary  to  test  the  question- 
naire (or  the  several  versions  of  it)  in  an  actual 
interview  situation.  One  relatively  inexpensive 
method  is  to  test  one  or  two  topics  at  a  time  as  a 
supplement  to  the  current  household  sample  survey, 
which  is  being  conducted  in  Agrostan.   If  the  topic 
requires  several  questions  or  is  dependent  on 
another  topic,  the  related  questions  should  be 
pretested  as  a  group. 

Eventually  it  is  necessary  to  test  the  ques- 
tionnaire as  a  whole .  The  respondents  may  be 
designated  in  one  of  several  ways : 

(1)  Scientific  selection  of  a  sample  from  the 
general  population;  for  example,  a  sample  of 
EA's  throughout  a  Province,  or  a  sample  of 
holdings  in  a  district. 

(2)  Arbitrary  selection,  based  on  convenience; 
for  example,  persons  living  near  the  NSO 
office . 

(3)  Purposive  selection.  Respondents  may  be 
operators  of  holdings  that  are  identified 
as  having  a  particular  characteristic;  for 
example,  to  test  the  questions  on  production 
of  cotton,  the  holdings  might  be  those  which 
sent  cotton  to  the  various  processing  plants. 
Or  to  test  responses  on  tenure,  the  selected 
holdings  might  be  owners  listed  on  tax  rolls . 

The  method  of  scientific  selection  is  usually 
preferred  if  sufficient  time  and.  resources  are 
available  to  carry  out  the  pretest  successfully. 
If  the  sample  is  large  enough  to  yield  reasonably 
reliable  estimates,  data  can  be  tabulated  from  the 
pretest  questionnaires  and  analyzed  statistically. 
This  can  be  a  particularly  useful  method  for  test- 
ing alternative  wordings  for  certain  questions  or 
alternative  layouts  of  the  questionnaire.  The 
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sample  is  divided  randomly  into  several  subsamples, 
and  the  various  alternatives  are  assigned  at  ran- 
dom to  the  subsamples .   If  the  results  among  the 
groups  differ,  this  will  indicate  that  some  of  the 
alternatives  are  more  effective  than  others .  One 
may  not  know,  however,  which  alternative  is  best 
to  use  unless  some  independent  data  are  available 
with  which  the  sample  results  can  be  compared. 
Of  course,  one  must  not  attempt  to  test  too  many 
alternatives  at  a  time . 

Purposive  selection  may  be  desirable  in  order 
to  reflect  important  small  subgroups  of  the  popu- 
lation, such  as  persons  living  in  sparsely  settled 
areas,  persons  speaking  different  languages,  or 
persons  with  widely  differing  cultural  backgrounds . 

Arbitrary  selection  has  obvious  disadvantages 
in  that  the  results  may  be  biased;  however,  con- 
siderations of  money,  time,  and  personnel  may  lead 
to  this  choice .   It  will  usually  be  better  than 
making  decisions  in  the  NSO  with  no  field  pretest- 
ing at  all. 

The  testing  area  may  also  be  chosen  to  provide 
some  information  on  costs  of  travel  in  rural  areas, 
time  required  to  enumerate  a  holding,  etc . 

At  the  time  of  the  pretest  of  the  question- 
naire, a  decision  should  have  already  been  made  as 
to  the  size  and  type  of  enumeration  areas  to  be 
used,  functions  of  the  field  supervisor,  the  amount 
of  training  and  supervision  he  receives,  and  the 
amount  of  time  and  the  methods  he,  in  turn,  uses 
in  training  and  supervising  the  enumerators.  Pre- 
testing can  serve  to  point  up  major  problems  which 
may  require  changing  some  decisions  which  have 
already  been  made  for  field  operations. 

2.2   Pretesting  time  and  cost 

From  the  pretests  some  information  will  be 
obtained  on  the  time  it  takes  to  enumerate  the 
questionnaire,  the  time  to  travel  from  one  housing 
unit  to  the  next,  and  how  these  times  vary  from 
one  part  of  the  country  to  another.  This  type  of 
information  is  useful  in  deciding  the  basis  for 
paying  enumerators,  and  for  determining  whether 
they  are  receiving  a  reasonable  payment  for  the 


work  done.   It  also  assists  the  budget  office  in 
determining  how  much  the  field  work  for  the  census 
will  cost.   If  the  cost  is  too  high  relative  to  the 
total  budget,  alternative  procedures  will  have  to 
be  set  up  for  some  parts  of  the  program — field 
work,  office  work,  tabulation,  or  publication — 
or  more  money  may  have  to  be  obtained. 

2.3  Pretesting  office  procedures 

The  questionnaires  which  are  collected  during 
pretesting  can  be  used  to  decide  on  the  office  pro- 
cedures to  be  followed  in  processing  the  census . 
This  applies  to  all  phases  of  the  office  operations . 

2.31  Check-in  procedure .--The  forms  and  oper- 
ations required  for  receipt  and  control  of  the 
census  enumeration  (and  particularly  the  control 

of  the  census  sample)  should  also  be  tested.  Sev- 
eral forms  and  procedures  may  seem  equally  usable . 
An  office  test,  with  randomization  among  clerks, 
can  assist  in  deciding  which  process  is  most  likely 
to  be  successful. 

2.32  Office  processing. — Later  units  in  this 
case  study  describe  the  processing  operations  and 
quality  control  procedures  to  be  used.  The  deci- 
sion as  to  which  quality  control  procedure  to  use, 
and  how  to  apply  it,  can  be  made  after  running  some 
tests  on  the  results  collected  in  the  pretest  enu- 
meration. For  this  purpose,  the  testing  should  be 
done  on  a  random  basis — that  is,  a  probability 
sample  should  be  used,  with  work  assigned  to  clerks 
in  such  a  way  that  valid  distinctions  can  be  drawn 
from  the  results  of  the  tests . 

2.4  Experimental  census 

Before  taking  the  census  throughout  the  whole 
country,  it  should  be  conducted  in  an  area  (one  or 
more  districts)  of  moderate  size  and  with  varied 
characteristics,  so  that  all  aspects  of  the  census 
operation  can  be  tested .  Preferably,  this  is  done 
about  a  year  in  advance  of  the  census  date .  After 
that  date,  little  change  can  be  expected  to  be 
made  in  the  form  of  the  questionnaire,  in  the  field 
procedures,  or  in  the  office  processing.  Neverthe- 
less, it  is  desirable  to  carry  out  such  an  exercise 
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to  discover  if  there  are  airy  major  problems  which 
will  arise  in  the  conduct  of  the  census.  If  a 
serious  problem  is  encountered  at  any  stage,  steps 
can  still  be  taken  to  correct  it,  or  to  reduce  its 
effect. 

For  this  purpose,  the  area  in  which  the  ex- 
perimental census  will  be  conducted  is  usually 
not  selected  on  a  probability  basis .  The  training 
of  supervisors  and  enumerators  is  conducted,  ques- 
tionnaires are  developed,  the  census  is  enumerated, 
the  results  are  processed  (edited,  coded,  and 
tabulated),  and  a  final  publication  is  produced. 
Every  part  of  the  census  operation  is  followed 
through,  including  computation  of  variances  for 
items  which  are  enumerated  on  a  sample  basis . 

3.    SAMPLING  IN  QUALITY  CONTROL 
OPERATIONS 

The  conduct  of  a  census,  from  the  planning 
and  the  first  office  operations  through  the  publi- 
cation of  the  results,  requires  many  operations. 
Most  of  these  are  on  a  large  scale.  Some  are  so 
important  that  they  must  be  verified  completely; 
others  can  sometimes  be  verified  on  a  sample  basis. 
This  section  mentions  the  operations  in  which  sam- 
pling can  be  used  and  those  operations  which  di- 
rectly affect  the  sampling. 

3.1    Quality  control  in  pre -enumeration 
activities 

The  decisions  as  to  the  kind  of  census  to  be 
taken,  the  time  at  which  it  will  be  taken,  the 
questions  to  be  asked,  and  the  types  of  data  to  be 
tabulated  are  usually  either  required  by  law  or 
determined  by  a  group  of  high-level  personnel  in 
the  government  of  the  country,  sometimes  by  the 
chief  executive  himself.  After  the  general  form 
of  the  census  has  been  decided,  there  is  much 
preparatory  work  to  be  done.  A  few  of  the  major 
preparatory  operations  are  mentioned  below. 

3.11  Mapping . — If  possible,  a  map  of  each 
PSU  is  provided  for  the  enumerator.  The  usual 
procedure  is  to  delineate  the  FSU's  on  maps  for 
the  subdistricts  in  each  Province,  using  defin- 
able boundaries  such  as  roads,  rivers,  well- 
traveled  paths,  railroads,  tops  of  mountain  ridges, 


etc.  Hence,  the  identification  of  PSU's,  and  the 
mapping  work  which  goes  with  the  PSU's  should  be 
checked  100$;  there  should  be  as  little  likelihood 
of  error  in  the  maps  and  descriptions  of  the  PSU's 
as  is  humanly  possible. 

3.12  Format  of  questionnaires  and  listing 
books . — The  census  uses  a  number  of  questionnaires, 
listing  books,  and  other  forms,  both  in  the  field 
and  in  the  office .  The  questionnaires  particularly 
should  be  checked  thoroughly  by  several  persons 
before  printing,  to  be  sure  that  there  are  no  errors 
either  in  the  wording  of  questions,  the  location 

of  questions  on  the  form,  numbering  of  columns,  etc. 

The  persons  who  design  the  questionnaires 
should  try  to  make  them  as  easy  to  use  as  possible, 
both  in  the  field  enumeration  and  in  office  pro- 
cessing. Hence,  the  format  of  all  field-use  ques- 
tionnaires should  be  checked  100$,  preferably  by 
several  persons . 

3.13  Printing . — After  the  forms  have  been 
received  from  the  printer,  a  check  is  required  to 
see  that — 

(1)  The  printing  has  followed  the  format  speci- 
fied. 

(2)  The  printing  is  reasonably  uniform. 

(3)  The  pages  have  been  assembled  properly  in  the 
Listing  Book. 

These  and  similar  checks  can  be  made  on  a 
spot-check  basis;  no  scientific  sampling  is  usual- 
ly involved.  The  verifier  should  assure  himself 
that  the  printing  job  is  well  done,  but  a  very 
detailed  check  should  not  be  necessary. 

3.14-  Review  of  training  materials  and 
manuals. --The  materials  used  for  training  district 
supervisors  and  enumerators,  particularly  in  the 
sampling  aspects  of  the  census  program,  are  very 
important  and  should  receive  as  much  care  in  prep- 
aration as  the  questionnaires  themselves .  In  the 
same  way,  the  Enumerator's  Reference  Manual  should 
be  as  complete  and  free  from  errors  as  it  is  possi- 
ble to  make  it.  It  is  recommended  that  all  the  ma- 
terials be  reviewed. 

3.15  Distribution  of  materials. — The  Listing 
Books,  Agriculture  Questionnaires,  maps,  training 
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materials,  reference  manuals,  and  forms  for  con- 
trol of  the  field  operation  will  "be  distributed 
in  stages — first  to  the  PCO,  and  then  to  the  dis- 
trict supervisors,  who  will  give  the  materials 
for  individual  PSU's  to  his  enumerators.  Spot- 
checks  may  be  adequate  at  the  Province  level. 
However,  the  district  supervisor  must  check  all 
the  materials  very  carefully  to  be  sure  he  has  re- 
ceived all  the  materials  that  he  should  receive 
and  that  they  are  identified  properly. 

3.2  Quality  control  in  field  enumeration 

The  responsibilities  of  the  district  super- 
visor during  the  enumeration  include  controlling 
the  quality  of  work  of  the  enumerator,  insofar  as 
possible .  Some  of  these  tasks  must  be  performed 
on  a  100-percent  basis  and  some  can  be  reviewed 
on  a  sample  basis.  In  the  case  of  the  latter, 
the  sample  may  be  a  scientifically  selected  one 
for  some  purposes  and  a  spot-check  for  others . 

3.21  Checking  PSU  boundaries . — As  part  of 
his  training,  or  immediately  thereafter,  the  dis- 
trict supervisor  should  check  the  boundaries  of 
the  PSU's  which  are  to  be  assigned  to  his  enumer- 
ators .  He  should  do  this  for  every  sample  PSU 
so  as  to  avoid  having  any  enumerator  waste  time 
locating  the  boundaries  or  having  him  enumerate 
the  wrong  areas . 

3.22  Observation  of  enumeration. — As  early 
as  possible  after  the  start  of  enumeration,  the 
district  supervisor  should  visit  each  enumerator 
and  observe  him  while  he  is  listing  or  enumerating. 
The  district  supervisor  is  asked  to  check  some 
portions  of  the  enumerator's  work  entirely;  for 
other  portions,  he  is  asked  to  check  a  sample  of 
pages,  lines  or  items.  He  should  be  given  specific 
instructions  on  how  to  select  the  sample,  how  to 
record  his  findings,  and  when  to  take  remedial 
action. 

3.3  Quality  control  of  office  processing 

As  with  the  field  enumeration,  quality  con- 
trol should  be  an  integral  part  of  the  office 
operations.  The  amount  of  checking  which  is  re- 


quired will  depend  upon  the  operation. 

Some  of  the  checking  of  office  operations 
will  involve  100%  verification  and  others  will 
require  sample  verification.  In  still  other  cases, 
an  examination  of  summary  records  may  be  adequate; 
for  example,  a  comparison  of  the  number  of  NC 
holdings  that  are  tabulated  for  a  district  with 
the  number  on  the  NC  list  for  that  district;  if 
the  numbers  are  inconsistent,  further  checking  is 
obviously  required. 

3.31  Editing,  coding,  key  punching,  tabu- 
lating . — Sampling  is  used  extensively  in  the  quality 
control  of  data-processing  operations  to  assure 
high  quality  work  at  substantially  lower  cost  than 
if  all  the  work  is  verified.  In  some  cases,  the 
work  of  a  clerk  is  verified  100%,  then  reduced  to 
sample  verification  as  long  as  his  work  is  of 
acceptable  quality. 

3.32  Control  of  materials . — One  operation 
which  is  often  given  too  little  importance  in  the 
census  program  is  the  control  of  census  documents 
and  records.  The  task  of  distributing  and  receiving 
materials  involving  large  numbers  of  individuals 
through  several  levels  of  field  supervision,  the 
control  of  these  materials  to  and  from  various 
processing  operations  and  the  final  storing  and 
cataloguing  of  the  records  are  most  important  to 
the  success  of  the  census. 

3.33  Review  of  data. — Unit  IX  of  this  case 
study  discusses  the  review  and  evaluation  of  data 
collected  in  the  census.  The  chief  procedure  used 
is  a  check  for  consistency  of  results.  However, 
not  every  figure  which  is  published  can  be  examined; 
hence,  a  large  part  of  the  checking  (particularly 
for  small-area  data)  will  be  done  on  a  reduced 
basis.  Either  a  scientific  sample  of  areas  can 

be  checked,  or  the  checking  may  be  done  on  a  more 
informal  basis.  Larger  and  more  important  areas 
will  obviously  be  checked  more  thoroughly. 

4.    SAMPLING  IN  TABULATIONS 

Frequently  one  finds  that  the  census  is  cost- 
ing more  money  than  was  budgeted  for  it;  or  some 
national  emergency  arises  which  gives  the  census 
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low  priority.   In  such  a  case,  the  census  staff, 
including  the  sampling  technician,  should  be  pre- 
pared to  do  some  (if  not  all)  of  the  census  tabu- 
lations on  a  sample  basis . 

As  a  part  of  planning  for  the  census,  it  would 
be  desirable,  to  have  a  set  of  priorities  estab- 
lished, so  that  if  it  is  necessary  to  cut  back, 
it  can  be  done  in  an  intelligent  and  scientific 
way.  This  requires  a  knowledge  of  the  real  re- 
quirements for  the  census  data.  Which  are  abso- 
lutely essential?  Which  can  be  cut  back  to  the 
Province  level?  Which  need  be  produced  only  at 
the  national  level? 

An  analysis  of  the  real  uses  of  census  data 
is  a  very  difficult  undertaking,  and  one  can  never 
be  sure  he  is  setting  the  priorities  correctly. 
But  some  attention  to  this  problem  should  be  part 
of  the  early  planning  for  the  census,  so  that  if 
a  cutback  is  necessary,  it  can  at  least  be  an 
orderly  one . 

For  broad  policy-making  purposes  and  in  or- 
der to  have  an  overall  view  of  the  progress  of  a 
country,  there  are  some  tabulations  which  need  to 
be  made  only  at  the  national  level,  with  perhaps 
a  few  regional  breakdowns .  For  tabulations  such 
as  these,  it  is  not  necessary  to  process  all  of 
the  estimated  75,000  holdings  in  the  sample  in 
Agrostan. 

There  are  also  tabulations  of  a  very  special- 
ized type  for  which  local  data  would  be  so  de- 
tailed as  to  make  the  tabulations  impractical  to 
use .  Here  again  tabulation  of  the  entire  file  is 
not  necessary  for  large-area  tabulations. 

When  small -area  detail  is  not  required,  a 
sample  of  questionnaires  may  be  tabulated  to  pro- 
vide data  which  would  be  too  expensive  if  the 
whole  file  were  processed.   It  is  obvious  that 
such  a  sample,  with  a  much  smaller  set  of  ques- 
tionnaires to  be  tabulated,  will  cost  less  and 
save  time.  Another  possibility  would  be  to  tabu- 
late a  subsample  of  the  punched  cards . 


5.    SAMPLING  FOR  AN  E VAL UA TION  SURVEY 

No  matter  how  carefully  a  survey  is  planned, 
or  how  careful  and  dedicated  the  persons  who  carry 
it  out  are,  errors  are  always  made.  For  example, 
in  an  agricultural  census,  it  is  extremely  diffi- 
cult to  identify  each  holding  that  is  associated 
with  each  household  in  a  sample  PSU.  Generally 
some  are  missed. 

The  usual  practice  is  to  conduct  a  small- 
scale  sample  survey  as  soon  as  possible  after  the 
census  to  determine  how  well  the  census  enumerated 
the  holdings.  A  post-enumeration  survey  (evalu- 
ation survey)  is  intended  to  do  this. 

6.    SAMPLING  FOR  INTERCENSAL  SURVEYS 

Sampling  is  used  extensively  for  intercensal 
surveys  primarily  because  of  the  time  and  cost 
to  carry  out  a  census .  Since  intercensal  surveys 
are  concerned  with  current  information  on  a 
country-wide  basis  (with  possibly  some  regional 
or  urban-rural  breakdown),  a  well-designed  sample 
should  be  quite  adequate .  The  particular  sample 
design  would  depend  very  much  on  the  type  of  in- 
formation that  is  being  collected  and  the  source 
materials  that  are  available.  Listings  of  hold- 
ings from  the  previous  census  can  be  used  as  a 
frame,  or  the  sample  can  be  based  on  information 
known  about  the  entire  census  PSU.   In  the  case 
of  the  latter,  the  measure  of  size  can  be  used  for 
stratification  or  a  sample  of  the  census  PSU's  can 
be  selected  with  probability  proportional  to  size. 

As  years  pass,  and  one  gets  further  away  in 
time  from  the  date  of  the  census,  it  becomes  less 
useful  as  a  frame  for  sample  surveys.  After  a 
few  years,  the  size  of  a  PSU  may  be  quite  differ- 
ent from  its  size  on  the  census  date .  Also,  small 
agricultural  holdings  may  become  large  in  size,  and 
vice  versa.  However,  there  may  be  no  other  source 
which  is  as  good  as  the  census  and  it  may  be  nec- 
essary to  use  it  as  a  basis  for  the  sample  in  spite 
of  its  limitations.   In  any  case,  when  census  data 
are  used  as  a  frame,  provisions  must  be  made  to  up- 
date the  frame . 


Chapter  IV  -.10.  SUMMARY  OF  ACTIVITIES 


1.    INTRODUCTION 

The  sampling  technician  plays  a  central  role 
in  the  preparation  and  the  carrying  out  of  a  sam- 
ple census  of  agriculture.   However,  his  activi- 
ties must  be  closely  coordinated  with  those  of 
other  specialists  who  work  on  the  plans  and  in 
the  operation  of  the  Agriculture  Census.   It  is 
important,  therefore,  that  the  sampling  techni- 
cian know  what  activities  he  will  be  involved  in, 
and  that  he  completes  his  work  on  these  activi- 
ties according  to  schedule  so  that  the  work  of 
other  specialists  will  not  be  delayed. 

From  past  experience  on  sample  censuses  and 
surveys,  the  sampling  technician  will  be  aware  of 
the  general  categories  of  activities  in  which  he 
should  participate.  As  he  develops  his  role  under 
each  of  the  general  categories,  he  will  arrive 
at  specific  procedures  which  must  be  carried  out 
by  himself,  by  someone  under  his  supervision,  or 
by  some  cooperating  office.   In  this  chapter  we 
will  first  list  the  general  duties  of  the  sampling 
technician  and  then  summarize  the  specific  duties, 
particularly  activities  relating  to  the  enumeration 
phase  of  the  Agrostan  Sample  Census  of  Agriculture. 

To  some  extent,  the  list  of  activities  is 
given  in  approximate  chronological  order.   The 
sequence  of  events  will  not  always  be  clear,  how- 
ever, since  many  of  the  procedures  are  carried 
out  at  the  same  time. 

1.1    General  responsibilities  of  the  sampling 
technician 

The  various  specialists  involved  in  the  cen- 
sus such  as  the  subject-matter  specialists,  the 
geographer,  the  data  processing  specialist,  etc., 
may  ask  the  opinion  of  the  sampling  technician 
on  any  aspect  of  the  census  plans.  In  fact,  the 
sampling  technician  should  make  an  attempt  at 


gaining  a  general  understanding  of  census  opera- 
tions carried  out  by  other  specialists.  The  main 
responsibilities  of  the  sampling  technician  can, 
however,  be  grouped  into  a  few  broad  categories. 
These  categories  are  given  below. 

(1)  In  cooperation  with  other  members  of  the 
Agriculture  Census  Working  Group,  decide  on 
the  administrative  areas  for  which  estimates 
will  be  made  and  the  level  of  reliability  re- 
quired by  the  data  users;  also,  decide  on  the 
types  of  items  or  characteristics  which  will 
be  included  in  the  publication  program. 

(2)  Determine  what  materials  and  data  are  avail- 
able from  past  censuses  and  surveys  for  de- 
signing and  selecting  the  sample;  review  the 
sampling  methods  and  data  from  the  1961  Cen- 
sus of  Agriculture  and  from  Current  Agricul- 
ture Surveys;  check  the  availability  of  suit- 
able EA  maps  and  preliminary  data  from  the 
1970  Population  Census.  Review  the  table 
outlines  for  the  1971  Census  of  Agriculture; 
advise  on  the  amount  of  detail  in  the  table 
outlines  in  terms  of  reliability  of  the  es- 
timates. 

(3)  Prepare  the  sample  design  for  enumeration. 
Decide  on  the  frame  for  each  stage  of  sampling, 
enumeration  plan,  method  of  estimation,  and 
method  for  computing  variances. 

(4-)  Set  up  a  time  schedule  for  the  various  opera- 
tions of  sample  selection  so  that  coordination 
of  sampling  and  nonsampling  activities  can  be 
carried  out  (see  exhibit  IV-10-1). 

(5)  Prepare  detailed  instructions  for  each  opera- 
tion in  the  sample  selection;  design  the  work- 
sheets and  office  forms  required  to  prepare 
the  sampling  frames  and  select  the  sample 
units. 

(6)  Participate  in  the  design  of  the  Listing  Books, 
Agriculture  Questionnaires,  and  other  forms. 

(7)  Determine  sampling  procedures  for  (a)  pretest- 
ing, (b)  quality  control  of  pre-enume ration 
work,  field  enumeration,  and  data  processing, 
and  (c)  the  evaluation  program.  Participate 
in  these  activities  and  recommend  changes. 

(8)  Review  tabulated  results  for  statistical 
significance,  inconsistencies,  or  processing 
errors.  Assist  in  the  preparation  of  the 
final  report. 
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1,2   Summary  of  specific  sampling  procedures 
in  Agrostan 

Once  the  sample  design  is  decided  upon,  the 
specific  procedures  required  to  select  the  sam- 
ple, collect  the  data,  and  make  the  estimates 
can  be  developed.  The  list  given  below  contains 
the  specific  activities  related  to  sampling  which 
were  developed  for  the  Agrostan  Sample  Census  of 
Agriculture.  All  of  the  activities  are  not  en- 
tirely sampling  activities  and,  in  fact,  some 
may  involve  other  specialists.  However,  most  of 
the  activities  that  involve  the  sampling  techni- 
cian in  either  a  major  or  minor  role  are  included 
on  the  list. 

1.21  Provinces. — The  following  activities 
apply  to  the  operations  directly  or  indirectly 
related  to  sampling  operations  in  the  13  Provinces: 

(1)  Combine  small  districts  (those  with  fewer 
than  approximately  2,500  holdings)  either 
with  other  small  districts  or  with  contigu- 
ous large  districts.  Prepare  a  list  of  dis- 
tricts or  combinations  of  districts  for 
which  estimates  will  be  made. 

(2)  Assist  in  preparing  instructions  for  compiling 
the  list  of  National  Certainty  holdings  for 
each  district. 

(3)  Form  PSU's  from  the  1970  Population  Census 
EA's  (combining  small  EA's  and  subdividing 
very  large  ones)  and  assign  measures  of 
size. 

(4-)  As  required,  prepare  PSU  cards  to  facilitate 
stratification.  List  the  PSU's  on  the  PSU 
Sample  Selection  Sheet,  Form  A- 261  (exhibit 
IV-3-8)  in  the  order  they  appear  after  strat- 
ification.  If  no  stratification  is  needed, 
list  the  PSU's  in  numerical  order. 

(5)  Select  a  systematic  sample  of  PSU's  using 
the  same  sampling  rate  throughout  the  Pro- 
vinces. 

(6)  Cooperate  with  the  Geography  Division  to 
prepare  maps  for  the  sample  PSU's.  Desig- 
nate for  subdividing  those  EA's  which  have 
more  than  one  measure  of  size  (one  of  which 
is  a  sample  PSU) .  Assist  in  the  development 
of  a  numbering  system  for  the  sample  PSU's. 

(7)  In  cooperation  with  the  Agriculture  Division, 
design  the  Listing  Book  that  will  be  used  in 
preparing  the  second-stage  sampling  frame. 
Set  the  size  limits  for  the  Local  Certainty 
holdings  in  the  Provinces. 

(8)  Assist  in  preparing  instructions  for  listing 
holdings  in  the  Listing  Books.  Prepare 


instructions  for  the  subsampling  of  the 
holdings. 

(9)   In  cooperation  with  the  Field  Operations 
Division,  develop  the  enumeration  plan, 
paying  particular  attention  to  the  preven- 
tion of  bias  in  the  listing  and  subsampling 
operations. 

(10)  Assist  in  preparing  procedures  for  (a)  pretest- 
ing, (b)  quality  control  of  pre-enumeration 
activities,  observation  and  review  of  the  field 
enumeration,  post-enumeration  office  operations 
such  as  editing,  coding,  card  punching,  and 
tabulation,  and  (c)  the  evaluation  program. 

(11)  Develop  the  method  of  weighting  the  sample 
results  to  make  estimates  and  prepare  the 
necessary  instructions.  Review  the  noninter- 
views  and  recommend  action  to  keep  the  non- 
interview  rate  to  less  than  5%.     Prepare 
instructions  for  the  noninterview  adjustment. 

(12)  Review  the  procedure  used  to  check  in  the 
completed  Agriculture  Questionnaires  and 
Listing  Books. 

(13)  Specify  the  tabulation  needs  for  the  compu- 
tation of  estimates  of  variances. 

(14.)   Compute  variances  (using  the  form  exhibited 
in  chapter  7). 

(15)  Review  the  preliminary  tabulated  estimates 
for  reasonableness.   Consider  the  size  of 
the  estimates  obtained  and  their  variances. 
Look  for  errors  which  may  have  resulted  from 
improper  application  of  the  sampling  instruc- 
tions. Make  sure  the  amount  of  detail  re- 
quired by  the  tables  can  be  obtained  from 
the  sample  with  sufficient  accuracy. 

(16)  Review  the  analytic  and  descriptive  state- 
ments for  the  published  reports;  prepare  the 
section  on  the  description  of  the  sample  and 
sampling  error. 

1.22  Northern  Territory.— The  following 
activities  include  the  sampling  operations  applica- 
ble to  the  Northern  Territory: 

(1)  Assist  in  preparing  instructions  for  compiling 
the  list  of  National  Certainty  holdings  for 
each  zone. 

(2)  Make  a  list  of  chiefdoms  and  Special  EA's. 

(3)  Form  five  population  size  strata  by  zones; 
stratify  within  each  of  the  five  groups. 

(4.)  Prepare  PSU  cards,  as  needed  for  stratifi- 
cation. 

(5)  Select  a  sample  of  PSU's  using  a  different 
sampling  rate  in  each  size  group. 

(6)  Cooperate  with  the  Geography  Division  to  pre- 
pare maps  for  the  sample  PSU's.  Assist  in 
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the  development  of  a  numbering  system  for  the 
sample  PSU's. 

(7)  In  cooperation  with  the  Agriculture  Division, 
design  the  Listing  Book  that  will  be  used 

in  preparing  the  second-stage  and  .third- 
stage   sampling  frames. 

(8)  Assist  in  preparing  instructions  for  listing 
holdings  in  the  Listing  Book.  Prepare  in- 
structions for  the  subsampling  of  the  hold- 
ings for  the  Agriculture  Questionnaire  and 
for  area  measurement. 


(9)  Same  as  (9)  to  (16)  for  the  Provinces. 

1.3  Flow  chart  of  sampling  activities 

The  sampling  technician  may  wish  to  construct 
a  flow  chart  of  the  major  sampling  activities. 
Such  a  chart  will  help  him  to  schedule  his  work 
in  the  sequence  in  which  it  should  be  carried  out 
to  avoid  unnecessary  delays  in  any  particular  sam- 
pling operation.  Exhibit  IV-10-2  is  a  flow  chart 
for  Agrostan. 
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Exhibit  IV-2-1.    WORKSHEET,  FORM  P-270:    PRELIMINARY  COUNT  FROM  THE  1970  CENSUS  OF 
POPULATION  FOR  RANCHI  DISTRICT  IN  RAJ  PUR  PROVINCE 
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Exhibit  IV-2-2. 


PRELIMINARY  POPULATION  COUNT:   RURAL  ENUMERATION  AREAS  BY  SIZE  GROUPS, 
FOR  DISTRICTS  IN  RAJPUR  PROVINCE:    1970 
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17 
6 

9 

■a 

7 
2 

2 
2 

464 

531 

2 
6 

10,052 
31,221 

20 
65 

1 

1 
6 

3 
15 

6 
18 

7 
13 

i 

6 

1 
3 

4 

503 
480 

6 

33,948 

74 

4 

9 

17 

23 

13 

3 

2 

3 

459 

Kistna 

2 

7 
3 

15,578 

39,181 
26,754 

32 

81 
53 

3 
2 

2 

4 
3 

9 

24 

11 

9 

19 
16 

6 

16 
7 

3 

8 
9 

2 

4 
3 

1 

3 
2 

487 

Kolar 

484 

505 

6 

40,884 

79 

2 

4 

9 

23 

20 

13 

4 

4 

518 

8 

50,959 

97 

4 

3 

8 

27 

36 

10 

7 

2 

525 

-  Entry  represents  zero. 

Source:   1970  Census  of  Population. 


Exhibit  IV-2-3. 


PRELIMINARY  POPULATION  COUNT:    URBAN  ENUMERATION  AREAS  BY  SIZE  GROUPS, 
FOR  DISTRICTS  IN  RAJPUR  PROVINCE:    1970 


District 


Urban 
population, 
1  May  1970 


Number  of  urban  EA's  1970 


Total 


With  population- 


Under 
500 


500 
to 
599 


600 
to 
699 


700 
to 
799 


800 
to 
899 


900 
to 
999 


1,000 

or 
more 


Average 

number  of 

persons  per 

urban  EA 


Total 

Adoni 

Adjmer 

Baroda 

Bellary. . . . 

Bhopal 

Chanda 

Chirmuri. . . 
Dabhor 

Jaipur 

Kanpur 

Karur 

Kistna 

Kolar 

Poona 

Ranch! 

Warangol . . . 


164,914 

7,120 
99,480 
10,555 

5,994 

2,005 
2,814 
3,313 
1,327 

683 
3,917 
3,924 
1,428 

782 

648 

4,671 

16,253 


175 

7 

103 

12 

6 

2 
3 
4 
1 

1 
4 
4 
2 

1 

1 

5 

19 


13 


16 


24 


17 
3 


43 


33 
2 

1 


35 

2 
15 

4 
3 

1 
1 
1 


37 

5 

15 

2 


942 

1,017 
966 
880 
999 

1,002 
938 
828 

1,327 

683 
979 
981 

714 

782 
648 
934 
855 


-  Entry  represents  zero. 

Source:   1970  Census  of  Population. 


Exhibit  IV-2-4. 


Appendix 

PRELIMINARY  POPULATION  COUNT:    RURAL  ENUMERATION  AREAS  BY  SIZE  GROUPS, 
FOR  THE  PROVINCES:    1970 


77 


Province 


Total. 


Bolivar. 
Dewar. . , 
DuBois. , 
Hassam. . 


Liberto. . . . 

Lopez 

Lo-San-Tho. 
Morgan 


Paris. . . . 
Raj  pur . . . 

Rama 

Tali 

Valencia. 


Number 

of 

districts 


171 


12 

x19 
11 
12 

18 
16 
15 
15 
LC 


Number 
of   sub- 
districts 


1.093 


25 
52 
84 
51 

110 

235 

53 

:-'/■ 

46 

90 

47 

171 

44 


Rural 
population, 
1  May  1970 


6.090.387 


104,994 

277,015 
477,251 
245,867 

639,981 

1,243,892 

240,086 

492,982 

255,112 
520,087 
309,907 
1,026,749 
256,464 


Number  of  rural  EA's  1970 


Total 


11,826 


221 

595 
917 
482 

1,288 

2,064 

538 

980 

583 
1,057 

675 
1,870 

556 


With  population — 


Under 
200 


274 

16 
31 
24 
24 

16 
19 
22 
11 

32 
32 
L3 
15 

19 


200 
to 
299 


780 

31 
67 
69 

58 


38 
49 
69 

95 
64 
52 
61 
47 


300 

399 


I  ,".M 


32 
102 

134 
59 

247 
215 
H6 
165 

107 
175 
168 
233 
138 


400 

to 
499 


3.274 


42 
122 
236 

121 

352 
579 
179 

276 

157 
372 
191 
494 
153 


500 

to 
599 


2,715 


44 
167 
203 


292 

422 

69 

235 

109 
224 
177 
579 

108 


600 
to 
699 


1.490 


28 

75 
132 

63 

183 

310 

29 

HI 

53 

97 

39 

288 

52 


700 
to 
799 


839 

17 
20 
66 
38 

79 

274 

31 

59 

24 
55 
29 

116 

31 


800 

or 
more 


5_3_2 

11 
11 
53 
33 

39 

207 

13 

24 

6 
38 

6 
84 


Average 
number 
of  rural 
EA's  per 
district 


62 

28 
50 
65 
54 

107 

109 

49 

82 

32 
66 

45 

125 

56 


Average 
number  of 

persons 

per  rural 

EA 


515 

475 
466 
520 
510 

497 
603 
446 
503 

438 
492 
459 
549 
461 


Excludes  Calicut  District  which  is  entirely  urban. 
Source:  1970  Census  of  Population. 


Exhibit  IV -2-5. 


PRELIMINARY  POPULATION  COUNT:    URBAN  ENUMERATION  AREAS  BY  SIZE  GROUPS, 
FOR  THE  PROVINCES:    1970 


-  Entry  represents  zero. 

Source:  1970  Census  of  Population. 


Urban 
population, 
1  May  1970 

Number  of  urban  EA's  1970 

Total 

With  population — 

Average 

Province 

Under 
500 

500 
to 
599 

600 

to 

699 

700 

to 

799 

800 
to 
899 

900 

to 

999 

1,000 

or 
more 

persons  per 
urban  EA 

3.284.165 

3.110 

67 

1 

10 
3 

q 

12 

2 

4 

1 

7 

2 

16 

100 

2 

11 
6 

18 

19 

3 

5 

3 
13 

5 
14 

1 

2Z7 

3 

7 
28 
27 

42 

148 

6 

16 

5 
16 

6 
32 

1 

659 

4 
12 
31 
36 

51 

320 

11 

31 

8 
24 
28 

101 
2 

859 

10 
19 
57 
39 

83 

340 

22 

48 

7 

43 

32 

133 

26 

651 

14 
16 
69 
41 

68 

171 

19 

62 

8 

35 

18 

103 

27 

437 

8 
12 
58 
38 

66 
98 
11 
44 

6 
37 
14 
29 
16 

1.056 

45,998 

67,977 

290,337 

207,474 

334,810 

1,182,585 

75,900 

216,249 

37,871 
164,914 
104,585 
476,022 

79,443 

42 
66 

264 
190 

337 

1,108 

74 

210 

38 
175 
105 
428 

73 

1,095 
1,030 
1,100 

1,092 

993 

1,067 
1,026 
1,030 

997 

Tali 

942 

996 
1,112 

1,088 

78 


Appendix 

Exhibit  IV -2-6.    ESTIMATED  SAMPLE  SIZE,  FOR  THE  PROVINCES 


Province 


Population,  1  May  1970  (preliminary  count) 


Total  for 

agriculture 

census 

(I) 


Rural 


(2) 


Small 
urban 
places 

(3) 


Outskirts 

of  large 

urban 

places 

(4) 


PSU': 


Total 


(5) 


In 

sample 

(6) 


Housing  units 
(estimated  number) 


Total 


(7) 


Listings 

in  sample 

PSU's 

(8) 


Provinces,  total 

Bolivar 

Dewar 

DuBois 

Hassam 

Liberto 

Lopez 

Lo-San-Tho 

Morgan 

Paris 

Raj  pur 

Rama 

Tali 

Valencia 


6,74.7,729 

135,477 
313,778 
541,4.32 
286,34-6 

691,781 

1,381,114 

258,317 

528,833 

286,014 
563,123 
362,480 
1,115,901 
283,133 


6,090,387 


104,994 
277,015 
477,251 
245,867 

639,981 

1,243,892 

240,086 

492,982 

255,112 
520,087 
309,907 
1,026,749 
256,464 


413,340 

27,745 
28,960 
42,475 
23,453 

31,745 

61,714 

5,398 

14,558 

29,160 
31,772 
41,901 
59,058 
15,401 


244,002 


11,948 


y  m 


2,738 

7,803 

21,706 

17,026 

20,055 
75,508 
12,833 
21,293 

1,742 
11,264 
10,672 
30,094 
11,268 


248 
542 
916 
531 

1,231 

2,345 

498 

951 

546 
1,048 

670 
1,889 

533 


62 
136 
229 
133 

308 
586 
125 
238 

137 
264 
168 
472 
133 


1,204,210 

19,640 
46,840 
94,305 
41,500 

135,225 

276,22-5 

39,630 

99,720 

42,145 
89,385 
55,030 

224,115 
40,450 


301,055 

4,910 
11,710 

23,575 
10,375 

33,805 

69,055 

9,910 

24,930 

10,535 
22,350 
13,760 
56,030 
10,110 


All  holdings  within  scope  of  agriculture 
census  (estimated  number) 


Holdings  in  sample  (estimated  number) 


Total 
(9) 


Provinces,  total 

Bolivar 

Dewar 

DuBois 

Hassam 

Liberto 

Lopez 

Lo-San-Tho 

Morgan 

Paris 

Raj pur 

Rama 

Tali 

Valencia 


1,008,970 

17,855 
40,730 
78,585 

34,645 

110,615 

226,860 

36,530 

83,100 

38,815 
77,855 
47,990 
180,890 
34,500 


Small 
(10) 


977,490 

16,780 
38,510 
76,240 
33,005 

108,850 

223,600 

33,585 

80,465 

36,935 
73,935 
45,930 
176,935 
32,720 


Local 
Certainty 

(11) 


27,  i 


955 
2,160 
1,965 
1,400 

1,600 
2,240 
2,725 
2,475 

1,680 
3,520 
1,960 
3,595 

1,605 


National 
Certainty 

(12) 


Total 
(13) 


3,600 


71,680 


120 

60 

380 

240 

165 

1,020 

220 

160 

200 
400 
100 
360 
175 


1,410 
3,010 
5,635 
2,650 

7,360 

15,560 

3,005 

5,810 

2,940 
5,905 
3,460 
12,315 
2,620 


Small 
(14) 


61,100 


1,050 
2,410 
4,765 
2,060 

6,795 

13,930 

2,105 

5,030 

2,320 
4,620 
2,870 
11,05.0 
2,045 


Local 
Certainty 

(15) 


6,980 


240 
540 
490 
350 

400 
560 
680 
620 

420 
885 
490 
905 
400 


National 
Certainty 

(16) 


3,600 

120 

60 

380 

240 

165 

1,020 

220 

160 

200 
400 
100 
360 
175 


Column  1.   Sum  of  columns  2,  3,  and  4. 

Column  2.  All  EA' s  classified  as  rural. 

Column  3.   Urban  places  under  5,000  population. 

Column  4.   EA's  outside  the  built-up  area  of  urban  places  with  popu- 
lation of  5,000  and  over.   Delineation  of  such  EA's  is  based  on 
judgment  and  knowledge  of  the  area. 

Column  5.   Prirrw.T-y  sampling  units  established  for  the  1971  Sample 
Census  of  Agriculture;  roughly  equivalent  to  EA's  in  the  Popula- 
tion Census  except  that  small  EA's  are  combined  and  large  EA's 
are  subdivided. 

Column  6.   One-fourth  of  the  total  PSU's. 

Column  V.  Estimates  based  on  the  1960  average  number  of  persons  per 
housing  unit;  by  Province,  this  average  ranges  from  approximately 
5.0  to  7.0  persons. 

Column  8.   One-fourth  of  the  total  housing  units  that  are  in  the 
sample  PSU'  s . 

Column  9.   Estimates  of  holdings  in  1971  assume  the  same  relation- 
ship between  listings  and  holdings  that  was  found  in  the  1961 


Sample  Census  of  Agriculture,  with  adjustment  for  difference  in 

size  of  holdings  considered  out-of-scope.   By  Province,  this 

relationship  ranges  from  about  80  percent  to  92  percent. 
Column  10.  Holdings  between  l/lO  hectare  and  Local  Certainty  level. 
Column  11.  Holdings  50  to  499  hectares  in  Liberto,  Lopez,  Morgan, 

and  Tali  Provinces;  100  to  499  hectares  in  Bolivar,  DuBois, 

Hassam,  Lo-San-Tho,  and  Paris  Provinces;  and  200  to  499  hectares 

in  Dewar,  Rajpur,  Bama,  and  Valencia  Provinces. 
Column  12.  Holdings  of  500  hectares  or  more  or  having  specialized 

agricultural  operations. 
Column  13.  Sum  of  columns  14,  15,  and  16;  total  number  of  A-l 

questionnaires  to  be  filled. 
Column  14.  One-sixteenth  of  the  estimate  in  column  10  (one-fourth 

of  the  small  holdings  within  the  sample  PSU's). 
Column  15.  One-fourth  of  the  estimate  in  column  11  (all  the  Local 

Certainty  holdings  in  the  sample  PSU's). 
Column  16.  Same  as  figure  in  column  12;  all  National  Certainty 

holdings  are  to  be  enumerated. 


Appendix 

Exhibit  IV-2-7.    ESTIMATED  SAMPLE  SIZE,  FOR  RAJPUR  PROVINCE 
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District 


Population,  1  May  1970  (preliminary  counts) 


Total  for 

agriculture 

census 

(1) 


Rural 


Small 
urban 
places 

(3) 


Outskirts 

of  large 

urban 

places 

(4) 


PSU'S 


Total 


(5) 


In 
sample 

(6) 


Housing  units 
(estimated  number) 


Total 


(7) 


Listings 

in  sample 

PSU's 

(8) 


Total.. 

Adoni-Bellary 

Ajmer 

Baroda 

Bhophal 

Chanda 

Chirmuri 

Dabhor-Poona. 
Jaipur-Kistna 

Kanpur 

Karur 

Kolar 

Ranchi 

Warangol 


563,123 


48,780 
54,766 
46,618 
32,128 

47,749 
53,002 
40,414 

27,741 

35,138 
37,872 
39,963 
45,555 
53,397 


520,087 

46,157 
47,445 
a,  476 
30,123 

44,935 
49,689 
38,439 
25,630 

31,221 
33,948 
39,181 
40,884 
50,959 


31,772 


11,264 


1,048 


2,471 
3,789 

2,005 

2,814 
3,313 
1,975 
2,111 

3,917 

3,924 

782 

4,671 


2,623 
4,850 
1,353 


2,438 


87 
•)6 
88 
69 

86 
92 
78 

60 

68 
73 

74 
83 

r*4 


264 

22 

24 
22 
L7 

22 
23 
20 
15 

17 
18 
19 
21 
24 


89,385 

7,740 
8,695 
7,400 
5,100 

7,580 
8,415 
6,415 
4,405 

5,575 
6,010 
6,345 
7,230 
8,475 


22,350 

1,935 
2,170 
1,850 
1,275 

1,895 
2,105 
1,605 
1,100 

1,395 
1,505 
1,585 
1,810 
2,120 


All  holdings  within  scope  of  agriculture 
census  (estimated  number) 


Holdings  in  sample  (estimated  number) 


Total 
(9) 


Total.. 

Adoni-Bellary 

Ajmer 

Baroda 

Bhophal 

Chanda 

Chirmuri 

Dabhor-Poona. 
Jaipur-Kistna 

Kanpur 

Karur 

Kolar 

Ranchi 

Warangol 


77,85; 


6,745 
7,570 
6,445 
4,440 

6,600 
7,330 
5,585 
3,835 

4,860 
5,235 
5,525 
6,300 
7,385 


Small 
(10) 


73,935 


6,205 
7,270 
6,070 
4,230 

6,350 
7,065 
5,135 
3,520 

4,680 
5,000 
5,320 
6,070 
7,020 


Local 
Certainty 

(U) 


3,520 

480 
290 
340 

190 

230 
240 
400 
260 

150 
220 
180 
210 
330 


National 
Certainty 

(12) 


Total 
(13) 


400 

60 
10 
35 
20 

20 

50 

55 

30 
15 
25 

20 

3C. 


5,905 

570 
540 
500 
335 

475 
525 
470 
340 

360 
380 
400 
455 
555 


Small 
(14) 


4,620 

390 
460 
380 
265 

395 
440 
320 
220 

290 
310 
330 
380 
440 


Local 
Certainty 

(15) 


885 

120 
70 
85 
50 

60 

60 

100 

65 

40 
'>c. 
45 
55 
80 


National 
Certainty 

(16) 


400 

60 
10 
35 
21 1 

20 
25 
50 

55 

30 
15 
25 
20 
35 


-  Entry  represents  zero. 

Column  1.   Sum  of  columns  2,  3,  and  4. 

Column  2.  All  EA.'  s  classified  as  rural. 

Column  3.   Urban  places  under  5,000  population. 

Column  4.   EA's  outside  the  built-up  area  of  urban  places  with,  popu- 
lation of  5,000  and  over.   Delineation  of  such  EA's  is  based  on 
judgment  and  knowledge  of  the  area,. 

Column  5.   Primary  sampling  units  established  for  the  1971  Sample 
Census  of  Agriculture;  roughly  equivalent  to  EA' s  in  the  Popula- 
tion Census  except  that  small  EA's  are  combined  and  large  EA'  s 
are  subdivided. 

Column  6.   One-fourth  of  the  total  PSU's. 

Column  7.   Estimates  based  on  the  1960  average  number  of  persons  per 
housing  unit  for  Bajpur  Province,  6.3  persons. 

Column  8.   One-fourth  of  the  total  housing  units  that  are  in  the 
sample  PSU'  s . 


Column  9.   Estimates  of  holdings  in  1971  assume  the  same  relationship 
between  listings  and  holdings  (about  88  percent)  that  was  found 
in  the  1961  Sample  Census  of  Agriculture,  with  adjustment  for  dif- 
ference in  size  of  holdings  out-of-scope  for  Bajpur  Province. 

Column  10.  Holdings  between  1/10  hectare  and  local  Certainty  level. 

Column  11.  Holdings  200  to  499  hectares. 

Column  12.  Holdings  of  500  hectares  or  more  or  having  specialized 
agricultural  operations. 

Column  13.  Sum  of  columns  14,  15,  and  16;  total  number  of  A-l 
questionnaires  to  be  filled. 

Column  14.  One-sixteenth  of  the  estimate  in  column  10  (one-fourth 
of  the  small  holdings  within  the  sample  PSU's). 

Column  15.  One-fourth  of  the  estimate  in  column  11  (all  the  Local 
Certainty  holdings  in  the  sample  ISU's). 

Column  16.  Same  as  figure  in  column  12;  all  National  Certainty 
holdings  are  to  be  enumerated. 
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Exhibit  IV-2-8.    SIMPLE  METHOD  FOR  OPTIMIZING  SAMPLE  DESIGN 

Once  the  cost  factors  have  been  determined,  an  approximately  optimum  sample  design  for  the  average 
district  can  be  found.  If  enough  data  on  costs  and  variability  were  available,  very  complicated  mathemat- 
ical methods  could  be  used  to  determine  the  sample  design.   Such  methods  are  not  likely  to  result  in  a 
very  different  design  for  the  Agrostan  Sample  Census  of  Agriculture,  however,  than  the  simple  method  which 
is  discussed  below. 

From  statistical  theory  we  know  that  the  coefficient  of  variation  (V) ,  used  to  compute  the  relative 
standard  error  of  a  proportion,  is  1  when  P  =  .50,  since  V  =  v/-j»  .  The  coefficient  of  variation  is  only 
about  1.5  when  P  is  as  small  as  .3.   For  larger  values  of  P  (larger  than  .50),  V  is  smaller.   Other  types 
of  estimates  such  as  averages  and  aggregates  will  have  much  larger  coefficients  of  variation  than  propor- 
tions, when  all  types  of  holdings  are  considered,  because  of  greater  differences  between  holdings.  How- 
ever, when  the  National  Certainty  holdings  are  placed  in  a  separate  universe,  the  coefficient  of  variation 
for  the  remaining  small  holdings  is  probably  close  to  1  for  the  most  important  characteristics.   Therefore, 
it  seems  reasonable  to  base  the  sample  size  for  the  average  district  on  characteristics  with  a  coefficient 
of  variation  of  1.   It  was  shown  in  section  5.1  of  chapter  2  (in  Unit  IV  of  the  case  study)  that  the  sample 
size  needed  to  estimate  this  type  of  characteristic  with  a  coefficient  of  variation  of  5%   is  4-00  holdings 
when  simple  random  sampling  is  used. 

At  this  point,  some  estimate  must  be  made  of  the  effects  of  the  various  refinements  which  will  be  in- 
cluded in  the  design.  Let  us  assume  that  the  stratification  of  the  PSU's,  the  inclusion  of  Local  Certain- 
ty holdings  and  National  Certainty  holdings,  and  the  stratification  of  the  holdings  by  size  within  the 
sample  PSU's  will  result  in  the  required  sample  size  for  the  small  holdings  being  reduced  to  about  200  for 
simple  random  sampling.  With  cluster  sampling  this  number  will  be  increased  depending  on  the  intraclass 
correlation  factor  (8)  and  the  average  number  of  sample  units  per  sample  cluster  (n) . 

Both  6  and  ri  can  vary  over  a  relatively  large  range.  Moreover,  the  value  of  m  (the  number  of  PSU's 
required)  will  be  different  for  each  value  of  n  or  6  .   The  procedure  which  will  be  followed  here  is  to 
assume  a  value  of  6  and  then  to  consider  alternative  values  of  n  and  the  resulting  values  of  m,  together 
with  the  cost  factors  given  in  section  6.1.  The  values  of  m  and  n  which  provide  the  desired  reliability 
at  lowest  cost  will  determine  the  approximately  optimum  design  for  the  assumed  value  of  6  . 

For  values  of  6  which  are  lower  than  the  assumed  value,  the  sample  size  will  be  large  enough.  For 
values  of  6  that  are  larger  than  the  assumed  value,  it  is  probable  that  the  sample  size  will  be  too 
small  for  the  desired  reliability,  but  the  loss  in  reliability  will  likely  not  be  as  great  as  it  appears. 
With  high  values  of  6,  the  stratification  of  PSU's  is  more  effective  and  the  loss  in  reliability  from  the 
use  of  a  small  sample  size  is  partially  compensated.  Therefore,  if  we  assume  a  value  for  6  which  is  rep- 
resentative of  most  of  the  important  characteristics  for  district  estimates,  the  optimum  design  for  these 
characteristics  should  be  reasonably  appropriate  for  characteristics  with  smaller  or  larger  values  of  8. 

In  our  earlier  consideration  of  required  sample  size,  a  value  of  .05  for  6  was  assumed.   This  value 
should  be  quite  representative  of  many  characteristics  in  Agrostan,  when  the  average  size  of  the  PSU's  (N) 
is  about  80  holdings.  Let  us  then  use  6  =  .05  to  consider  alternative  sample  designs  for  the  Agrostan 
Sample  Census  of  Agriculture. 

The  following  table  gives  the  number  of  PSU's  (m)  and  total  variable  costs  for  different  average 
sizes  of  sample  within  the  PSU's  (different  values  for  n) .  The  table  also  shows  the  total  sample  size 
(mn)  required  in  each  case.  We  can  now  choose  the  sample  design  which  will  cost  the  least  while  providing 
the  desired  reliability  using  PSU's  with  intraclass  correlation  of  about  .05  for  these  characteristics. 
For  the  table,  the  reliability  is  assumed  to  be  the  same  for  each  sample  design  considered.  Alternatively, 
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Exhibit  IV -2-8.    SIMPLE  METHOD  FOR  OPTIMIZING  SAMPLE  DESIGN— Continued 

we  could  have  held  the  variable  cost  constant  and  considered  the  sampling  error  for  each  design;  the  pur- 
pose would  have  been  to  minimize  the  sampling  error  for  a  fixed  cost. 


TOTAL  VARIABLE  COSTS  FOR  ALTERNATIVE  TWO-STAGE  SAMPLE  DESIGN 
(When  6  is  .05,  Cx  is  $40,  and  C2  is  $2) 


Sample 
holdings 
per  PSU 

Number  of 
sample 

PSU's 

Total 

number 

of  sample 

holdings 

Total  variable 
costs 

(n) 

(m) 

(mn  ) 

(C1m  -  C  mn  ) 

1 

200 

200 

$8,400 

4 

57 

230 

2,740 

8 

34 

270 

1,900 

12 

26 

310 

1,660 

16 

22 

350 

1,580 

20 

19 

380 

1,520 

24 

18 

430 

1,580 

28 

17 

470 

1,620 

From  the  table  we  can  determine  the  optimum  sample  design  when  6  is  .05,  C  is  $40,  and  C  is  $2. 
When  n  is  1,  the  design  is  nearly  the  same  as  for  simple  random  sampling.  This  design  is  unquestionably 
inefficient  compared  to  a  design  with  some  clustering;  therefore  it  is  deleted  from  further  consideration. 

When  n  is  20,  the  number  of  sample  PSU's  required  is  19.  This  design  is  least  costly  of  all  the  al- 
ternatives.  (The  number  of  PSU's  for  the  design  was  computed  to  the  nearest  whole  PSU;  actually  the  de- 
sign requires  19.5  PSU's.)  If  390  holdings  are  selected  from  20  PSU's  instead  of  19,  the  cost  is  about 
the  same  as  the  designs  where  n  is  16  or  n  is  24.  Therefore,  all  three  of  these  designs  can  be  considered 
optimum  and  any  one  of  them  coul^  be  chosen  for  the  Agrostan  sample  design. 

The  design  where  n  is  20  and  m  is  20  was  chosen  because  the  selection  of  the  sample  in  this  design  is 
more  convenient  than  in  the  other  two.  The  average  number  of  PSU's  per  district  is  about  80  and  the  number 
of  holdings  within  the  PSU's  averages  about  80  to  85.   Thus,  a  l-in-4  sample  of  PSU's  from  each  district 
and  a  l-in-4  sample  of  holdings  from  each  sample  PSU  will  provide  enough  sample  holdings  and  result  in 
sampling  rates  (and  weights)  which  are  easy  to  apply.  This  is  an  important  consideration  when  the  sampling 
of  holdings  is  selected  by  field  personnel,  as  is  the  case  In  Agrostan.  The  misapplication  of  a  more  com- 
plex sampling  interval  could  easily  result  in  a  serious  bias. 

A  sample  of  20  holdings  from  each  of  20  PSU's  is  therefore  approximately  optimum  when  6  is  .05.  How 

well  Is  a  characteristic  with  6  -  .10  estimated  from  this  sample?  The  reader  will  recall  that  the  coeffi- 

a_,  ., 

cient  of  variation  VTi  of  an  estimate  is  —2.  and  its  square,  the  relative  variance  of  an 'estimate  is  V  "■  = 

X       8  -3 

With  the  assumptions  used  in  Agrostan,  Vxi 


_x'   .   With  the  assumptions  used  in  Agrostan,  Vxi  =  (.05)2.   This  was  determined  by  multiplying—^- for  a 

xs  —  x2 

simple  random  sample  by  a  factor  (1 +  6  (n  -  1)).  When  6  =  .05  and  n  =  20,  this  factor  is  1.95;  when  6  is  .1 

-  2  2  90  s 

and  n  is  20,  the  factor  is  2.90.   Thus,  the  V ~i  for  the  case  when  6  is  .1  is  ;  'AC  or  1.49  times  the  V-, 


1.95 


when  6  is  .05.   The  coefficient  of  variation  Vxi  when  6  is  .1  is  approximately  Vl.49  Vx<  or  1.22  V  i  when 
6  -  .05.  The  sample  of  400  holdings  selected  from  20  PSU's,  which  would  provide  estimates  of  characteris- 
tics with  a  Vxi  of  .05  when  6  =  .05,  would  provide  estimates  with  a  Vxi  =1.22  (.05)  or  .06  when  6  is  .10. 
We  can  conclude,  therefore,  that  a  moderate  mis judgment  in  the  size  of  the  intraclass  correlation  coeffi- 
cient will  not  normally  result  in  a  serious  loss  in  reliability. 

The  assumptions  of  a  coefficient  of  variation  V  =  1  and  an  intraclass  correlation  coefficient  6  =■  .05, 
were  chosen  for  conditions  which  exist  in  Agrostan.  In  many  countries,  one  or  both  of  these  factors  may  be 
much  larger;  if  so,  a  sample  larger  than  400  holdings  would  be  needed.  Alternatively  a  V  i  larger  than  .05 
for  the  important  characteristics  might  be  acceptable. 
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Exhibit  IV-3-1.    LIST  OF  1961  LARGE  HOLDINGS  (PROVINCES),  FORM  A-3,  FOR  RANCHI  DISTRICT 

IN  RAJ  PUR  PROVINCE 


FORM  A-3  (AGROSTAN) 
(Morch  1971) 


LIST  OF  1961  LARGE  HOLDINGS 
(Provinces) 


/C««-v^--«-ay 


7d^a^Kvo^->x-^ 


Code 

10 


Code 

IX 


N~ame 

of 

respondent 

(31 


holding 
(hectares) 


FOR  AGRICULTURAL  AGENT'S  USE 


Type  of  holding 
(crop,  livestock, 
:rop  and  livestock) 


National  Certainty  holding  in   19 
(6) 


(Ltr-o-'Cs.x 


9^3 


■^tA^o-iy 


□  No 


Explain  In 
column  (7) 


«?,  9  oi 


JC^rCot^rtAts 


□  No- 


Explain  In 
column  (7) 


X 


i<uuL  b^</^Y^ 


^   0  0O 


y\As\rZ4A^cAz^ 


□  Yes 
(^  No  - 


Explain  In 
column  (7) 


OUa^tJLcu   lO>-^JL 


^3  IS 


JCAsv-t&ZtrzJ'zs 


8" 

□  No 


■  Explain  In 
column  (7) 


A  loo 


'^-A/is-t^jLirT^b*' 


'      ]No 


'(7) 


H 


OJiu 


//*W 


s<S\sv^o/ 


SfYes 
□  No. 


Explain  In 
column  (7) 


12   1\ 


sL^sirteJ^orrJk' 


□  No  . 


Explain  In 
column  (7) 


-t^^Xd(  JU~  SieUnJh 


I^oS 


^C{^iy-6<>C£rTx*k^ 


Kf^e 
□  No 


Explain  In 
column  (7) 


?3i 


sXsuso-'Us 


□  No 


.2 


3 


L^o^JL^xur' 


S  LO 


-^A^a^ 


□  No 


Explain  In 
column  (7) 


A 


i 


l^niJtr^rLirkJ 


y^B 


/tVU<r42^ 


KJYe 
□  No 


•  Explain  In 
column  (7) 


Z 


s 


-zy ^*^A^y> 


i9i 


s<SL*rizS 


(3  Yes 

□  No- 


Explaln  In 
column  (7) 


<2 


SyufOtoMv 


I,     Z2.0 


sTLA^L/ 


Explain  In 
column  17) 


3 


3 


&Ud^C«J£ 


3,??9 


/CisV<^Cirdc, 


KlYes 
□  No. 


Explain  In 
column  (7) 


3 


(jJs'c-ou  T^i 


^d-^-hn^s 


2zew 


V,  Lbo 


y<-<sts<<oC<r*Az^ 


IS  Ye 
□  No 


3 


7 


J?*Z-^Jr-(-S~*is> 


CX-4>L-y<-yQj 


6>(,3. 


s^sxso-fo; 


□  No- 


Exploln  In 
column  (7) 


s 


{^<A^d^^^tiAJUO 


99o 


/Z/L^ri^ 


□  No- 


Explain  In 
column  (7) 


3 


L 


^*wcV 


7>ix^r' 


S?  I 


x/ul/ 


□  No. 


Explain  In 
column  (7) 


^ 


(ol2~ 


/<vui/ 


(S3  Yes 
□  No- 


Explain  In 
column  (7) 


V 


?SS 


JLsv<ye£Ar*-J*~ 


El  Ye 
□  No 


Exp/oln  In 
column  (7) 


V 


3 
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Exhibit  IV-3-1.    LIST  OF  1961  LARGE  HOLDINGS  (PROVINCES),  FORM  A-3,  FOR  RANCHI  DISTRICT 

IN  RAJ  PUR  PROVINCE— Continued 


SHEET aL^OF <■"■*»  SHEETS 


FORM  A-3  (AGROSTAN) 
(March  1971) 

NATIONAL    S 

I5TRY  OF   ECONOMY 
TATISTICAL,   OFFICE 

o.   Province  name                                                                                                     ;  Code 

LIST  OF  iy6l  LARGE  HOLDINGS 
(Provinces) 

b.  District  name           „                                                                                         .Code 

Ltm 
No. 

(1) 

Name 

of 
holding 

.(2) 

Name 

of 

respondent 

(3) 

Size 

Of 

holding 
(hectares) 

(4) 

Type  of  holding 
(crop,  livestock, 
crop  and  livestock) 

(5) 

FOR  AGRICULTURAL  AGENT'S  USE 

National  Certainty  holding  in   1971 
(6) 

Notes 
(7) 

(Mark  one) 

Form  A-2c 
Sheet      '     Line 

No            1        No. 

1 



1b^ 

Sgo 

/CnstrUs 

y  i  y 

□  No  -  Explain  In 
column  (7) 

2 

iRuL.  yU^t^ 

cC?  0 

/Ts^v^r' 

□  Yfi            -          m. 

—  !  - 

(?0  ^i^-ct^^je-c^ 

[2  No  -Explain  In 
^               column  f7) 

3 

□  No  -  Explain  In 
column  (7) 

4 

C]Ho-  Explain  in 
column  (7) 

5 

□  No-Exp/oinin 
column  (7) 

6 

pi  No  _  Explain  in 
column  (7) 

7 

□  No  -  Explain  in 
column  (7) 

8 

□  Yr-1                            > 

□  No  -Explain  in 
column  (7) 

9 

1 lTts                         "" 

□  No  -  Explain  in 
column  (7) 

10 



□  Yes                     »■ 
nN0-cEo?i°nn,7" 

11 

□  No  -  Explain  in 
column  (7) 

12 

1 1    rCL                                                  *" 

aNo-cEoTiTfS 

13 

□  No  -Explain  in 
*—*               column  (!) 

14 

□  No  -  Explain   in 
column  (7) 

15 

□  No  -  Explain  in 
column  (7) 

IE 

1 1  Yc                         > 

□  No  -  Explain  in 
column  (7) 

17 

□  No  -  Explain  in 
column  (7) 

18 

1 1  Yc-                         > 

□  No  -  Explain  in 

column  (7) 

19 

□  No  -  Explain  in 
column  (7) 

20 

1 |Ycs                       m 

□  No  -  Explain  In 
■— '               column  (7) 
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Exhibit  IV-3-2.    1971  NATIONAL  CERTAINTY  LISTING  (PROVINCES),  FORM  A-2c,  FOR  RANCHI 

DISTRICT  IN  RAJ  PUR  PROVINCE- 
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Exhibit  IV -3-2.    1971  NATIONAL  CERTAINTY  LISTING  (PROVINCES),  FORM  A-2c,  FOR  RANCHI 

DISTRICT  IN  RAJ  PUR  PROVINCE— Continued 
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Exhibit  IV-3-2.    1971  NATIONAL  CERTAINTY  LISTING  (PROVINCES),  FORM  A-2c,  FOR  RANCHI 

DISTRICT  IN  RAJ  PUR  PROVINCE— Continued 
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Exhibit  IV-3-2.    1971  NATIONAL  CERTAINTY  LISTING  (PROVINCES),  FORM  A-2c,  FOR  RANCHI 

DISTRICT  IN  RAJ  PUR  PROVINCE— Continued 
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Exhibit  IV-3-3.    PRELIMINARY  COUNT:  NUMBER  AND  POPULATION  OF  URBAN  PLACES,  BY  SIZE  OF 

PLACE,  FOR  THE  PROVINCES:    1970 


Sourne:     1970  Census  of  Population  for  the  Provinces. 


Exhibit  IV -3-4.    PSU  CARD  (PROVINCES),  FORM  A-260 


Size  group 

Urban  places 

Urban  population 

Number 

Percent 

Number 

Percent 

239 

29 

117 

53 

24 

3 

2 

1 

4 

6 

100.0 

12.1 

49.0 

22.2 

10.0 

1.3 

0.8 

0.4 

1.7 

2.5 

3,284.165 

100.0 

Urban  places  with  population — 
Under  2 , 000 

33,948 

379,392 

382,403 

313,734 

71,438 

71,741 

41,779 

302,191 

1,687,539 

1.0 

2,000  to  4,999 

11.5 

5,000  to  9,999 

11.6 

10,000  to  19,999 

9.6 

20,000  to  29,999 

2.2 

30,000  to  39,999 

2.2 

40,000  to  49,999 

1.3 

50,000  to  99,999 

9.2 

100 ,000  or  more 

51.4 

Form  A-260 

ACE0STAN                                    Nal 

(November   1970) 

MINISTRY  CF  ECONOMY 
/ional  Statistical  Office 

(a)    Sample  PSU  No. 

(b)    Card  No. 

PSU       CARD 
(Provinces) 

(c)   Province 

Code 

(d)    District 

Code 

(e)   Posted  by-                                        [Date 

1 

(f)    Subdistrict 

(g)   EA  No. 

1.   POPULATION: 

2.   MEASURE  OF   SIZE: 

5.  TYPE  OF  AGRICULTURE: 

3.   URBAN/RURAL: 

1  □  Urban             2  □  Rural          3  □  Remote 

6.   REMARKS: 

4.   ELEVATION: 

1  □  Lowland's      2IZI  Plateau      3  □  Highlands 

Appendix 

Exhibit  IV-3-5.    MAP  OF  CORDOBA  CITY  IN  WARANGOL  DISTRICT,  RAJPUR  PROVINCE 
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Exhibit  IV-3-6. 


MAP  OF  RANCHI  DISTRICT  IN  RAJPUR  PROVINCE  SHOWING  ENUMERATION  AREAS  FOR 
THE  1970  CENSUS  OF  POPULATION 
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Exhibit  IV-3-7.    DISTRICT  SUMMARY— PSU'S  IN  SAMPLE,  FORM  A-252,  FOR  RAJPUR  PROVINCE 


Form  A-252  (AGROSTAN)                                    ministry  of  economy 
(March  1971)                                           national  statistical  office 

A.  Province  or  zone    /)        .                                                          iCode    1 

DISTRICT  SUMMARY-PSU'S  IN  SAMPLE 
1971  CENSUS  OF  AGRICULTURE 

B.  Prepared  by         ,                                         1  C.  Date 

District 

Number 

of 
PSU's 

(3) 

Number 

of  NC 

holdings 

(4) 

Notes 
(5) 

Name 
(1) 

Code 
(2) 

fLcLtrT^ts"  lleJULa^^y. 

01 

23 

VX 

rly^Jp  J(rVi*+v<L*sts*..   'K«yw*v'  /vurC^JU*^/  £ifVLS  U  /% 

ox 
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1Sm\sa-tL()L^ 

03 

?  \ 

33 

ii^i/\p^tJL 
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XX 
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(  Jt^Aj^srv***AS\^LJ 

0(0 

XI 

2  X 

Psu     13  -be-  X^  y^Jl^U^^LU . 

tP^xJldh^ifXi  —   /   >fl-irK<a> 
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2t> 

S~X 

M/3,^t-^k>wv  ""    K^ot^aLy 

0% 

/& 

sy 

r\sCu^^\2***s\J 

Q°l 

in 

-*7 

1 

(0 

1 
is 

n 

A sO^t&sxJ 

II 

IV 

12 

7C*5L'-vv<-C-*^0 

IX 

xi 

23 

Pi  0  \as      1  &  ,6^*^(1-  -5.  0    ~fctr  J^  AsjijL^d+^ri^fiLtiL 

/pC&S~\  ^5K-*-i^-ZrOy 

13 

xjf 

3U 

|6   \)  jQs      I  X  y#st~tLs     1  7    A4T  -Ar*->a^-t-t^-tt^tA<-«^^«6' 
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Exhibit  IV-3-8.    PSU  SAMPLE  SELECTION  SHEET  (PROVINCES),  FORM  A-261  FOR  RANCHI  DISTRICT 

IN  RAJ  PUR  PROVINCE 


Form  A-261 

MINISTRY  OF  ECOHOH 

Shee 

1             of              / 

ACKOSTAN                                                                                                                               National  Statistical  urrioe 
(iferch  1971) 

PSU   SAMPLE   SELECTION  SHEET 
(Provinces) 

A.    Province                                      .(code) 

~~7?AjPu/e,                   /o 

B.    District                                        ((code)           ; 

C .    Posted  by-                                     ~\       .  _.                             [  Date 

Sub- 
district 
code 

(1) 

EA  No. 
(2) 

Urban/ 
rural 
code  a 

(3) 

1  opula- 
tion 

(4) 

Measure 

of 

size 

(5) 

To  be 
sub- 
divided2 

(6) 

Sample 
PSU 
No. 

(7) 

Sub- 

district 

code 

. 

EA  No. 

Urban/ 
rural 
code  1 

(3) 

Popula- 
tion 

(*) 

Measure 

of 
.size 

(5) 

To  be 

sub- 

divided 

C6) 

Sample 
PSU 
No. 

(71 

01 

OO  / 

1 

I3¥<Z 

JL 

oV 

a  €>?,  oop 

.2. 

i^"0 

/ 

OOo2- 

1 

G(pI 
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01 

oc9 

A. 

470 
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0  03 
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7C^L 

1 

c  /o 

*L 
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Exhibit  IV-4-1.    LISTING  BOOK  (PROVINCES),  FORM  A-2a,  FOR  PSU  19  IN  RANCHI  DISTRICT, 

RAJPUR  PROVINCE— Continued 
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xhibit  IV-5-1.    PRELIMINARY  COUNT:    CHIEFDOMS  BY  SIZE  GROUPS,  FOR  THE  NORTHERN  TERRITORY:    1970 


Number  of 

territorial 

districts 

Population, 
1  May  1970 

Number  of  chiefdoms 

Total 

With  population — 

Average 
number  of 

Zone 

Under 
100 

100 
to 
299 

300 
to 
599 

600 
to 
999 

1,000 
to 

3,499 

3,500 

or 
more 

persons  per 
chiefdom 

Total 

Atbara  River  Basin 

18 

8 
6 
4 

380,802 

158,584 

130,382 

91,836 

1,879 

776 
659 
444 

650 

297 
201 
152 

1,018 

363 
400 
255 

186 

103 
52 
31 

11 

6 
3 
2 

2 

4 
2 
3 

3 
1 
1 

203 

204 

198 

207 

Source:   1970  Census  of  Population. 


Exhibit  IV-5-2.    PRELIMINAR  Y  POPULATION  COUNT:    SPECIAL  ENUMERATION  AREAS  BY  SIZE  GROUPS, 

FOR  THE  NORTHERN  TERRITORY:    1970 


Population, 

1  May  1970 

Number  of  special  EA's 

Total 

V/ith  population — 

Average 
number  of 

Zone 

Under 
100 

100 

to 

299 

300 

to 

599 

600 

to 
999 

1,000 

to 
3,499 

3,500 

or 
more 

persons  per 
special  EA 

Total 

14,299 

7,449 
2,397 
4,453 

73 

37 
12 
24 

22 

13 
5 

39 

16 

4 

19 

12 

8 
3 

1 

- 

- 

- 

196 

Atbara  River  Basin 

201 

Northwest 

200 

Subic  Mountains 

186 

-  Entry  represents   zero. 

Source:      1970  Census   of  Population. 


Exhibit  IV-5-3.    PRELIMINAR  Y  POPULATION  COUNT:    URBAN  ENUMERATION  AREAS  BY  SIZE  GROUPS, 

FOR  THE  NORTHERN  TERRITORY:    1970 


Urban 
population, 

1  May  1970 

Number  of  urban  EA's  1970 

Total 

With  population — 

Average 
number  of 

Zone 

Under 
500 

500 
to 
599 

600 

to 

699 

700 

to 
799 

800 
to 
899 

900 
to 
999 

1,000 

or 

more 

persons  per 
urban  EA 

Total 

34,979 

26,825 
5,739 
2,415 

34 

25 

6 

3 

1 
1 

1 

1 

2 
2 

_2 
2 

10 

7 
3 

12 

9 
2 

1 

_6 

4 
1 

1 

1,029 

Atbara  River  Basin 

1,073 

Northwest 

957 

Subic  Mountains 

805 

-  Entry  represents  zero. 

Source:   1970  Census  of  Population. 
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Exhibit  IV -5-4.    ESTIMATED  SAMPLE  SIZE  EOR  THE  NORTHERN  TERRITORY 
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Population  size 
group 


Population,  1  May  1970 
(preliminary  count) 


Total  for 
agricul- 
ture 
census 

(1) 


Rural 


(2) 


Urban 


(3) 


PSU's 


Total 


U) 


Chief- 

doms  and 

Special 

EA's 

(5) 


Urban 


(6) 


PSU 
sampling 

rate 


(7) 


PSU's 

in 
sample 


(8) 


Popula- 
tion in 
sample 
PSU's 


(9) 


Housing 

units  in 

sample 

PSU's 

(estimated 

number) 

(10) 


Northern  Territory 
Atbara  River  Basin.... 

Less  than  300 

300  to  699 

700  to  1,499 

1,500  to  3,499 

3 , 500  or  more 

Northwest 

Less  than  300 

300  to  799 

800  to  999 

1,000  to  3,499 

3 , 500  or  more 

Subic  Mountains 

Less  than  300 

300  to  499 

500  to  1 , 299 

1,300  to  3,000 

3 , 000  or  more 


Northern  Territory. 

Atbara  River  Basin 

Less  than  300 

300  to  699 , 

700  to  1,499 

1,500  to  3,499 

3, 500  or  more 

Northwest , 

Less  than  300 

300  to  799 

800  to  999 , 

1,000  to  3,499 

3 , 500  or  more 

Subic  Mountains 

Less  than  300 

300  to  499 

500  to  1,299 

1,300  to  3,000 

3 ,  000  or  more 


407,279 


395,101 


12,178 


1,969 


1,952 


170,057 

93,196 

49,655 

6,967 

8,775 

11,464 

138,518 

100,099 

22,667 

5,394 

6,632 

3,726 

98,704 

73,258 

10,891 

4,565 

6,296 

3,694 


166,033 

93,196 

46,509 

6,089 

8,775 

11,464 

132,779 

100,099 

21,918 

1,726 

5,310 

3,726 

96,289 

73,258 

10,423 

2,618 

6,296 

3,694 


4,024 

3,146 
878 


5,739 

749 
3,668 
1,322 

2,415 

468 
1,947 


821 

689 

120 

6 

3 

3 

677 

610 

57 

6 

3 

1 

471 

430 

31 

6 

3 

1 


813 

689 

113 

5 

3 

3 

671 

610 

56 

2 

2 

1 

468 

430 

30 

4 

3 

1 


17 


(X) 

(X) 

1  in  24 

1  in  12 

1  in  6 

1  in  3 

all 

(X) 

1  in  24 

1  in  12 

1  in  6 

1  in  3 

all 

(X) 

1  in  24 

1  in  12 

1  in  6 

1  in  3 

all 


101 

44 

29 
10 

1 
1 
3 

33 

25 

5 

1 
1 

1 

24 

18 

3 

1 

1 
1 


46,485 


23,386 

3,983 
4,237 
1,089 
2,613 
11,464 

12,849 
4,062 
2,001 
852 
2,208 
3,726 

10,250 
3,056 
1,066 
743 
1,691 
3,694 


8,460 

4,255 

725 
770 
200 
475 
2,085 

2,340 
740 
365 
155 
400 
680 

1,865 
555 
195 
135 
310 
670 


All  holdings  in  sample 
PSU's  (estimated  number) 


Total 

(11) 


8,035 


4,045 
690 
730 
190 
455 

1,980 

2,220 
705 
350 
145 
380 
640 

1,770 
525 
185 
130 
295 
635 


Small 
(12) 


7,930 


3,985 
680 
720 
175 
440 

1,970 

2,190 
700 
340 
140 
375 
635 

1,755 
520 
185 
125 
290 
635 


National 
Certainty 

(13) 


Within 

PSU 

sampling 

rate 

(14) 


105 
60 

10 
10 
15 
15 

10 

30 

5 
10 
5 
5 
5 

15 
5 

5 
5 


(X) 

(X) 

1  in  2 

1  in  4 

1  in  8 

1  in  16 

1  in  48 

(X) 

1  in  2 

1  in  4 

1  in  8 

1  in  16 

1  in  48 

(X) 

1  in  2 

1  in  4 

1  in  8 

1  in  16 

1  in  48 


Overall 

sampling 

rate 


(15) 


1  in  48 


1  in  48 

1  in  48 
1  in  48 
1  in  48 
1  in  48 
in  48 


in  48 

in  48 
in  48 
in  48 
in  48 
in  48 


1  in  48 
1  in  48 
1  in  48 
1  in  48 
1  in  48 
1  in  48 


Holdings  in  sample 
(estimated  number) 


Total 
(16) 


1,565 


670 

350 

190 

35 

45 

50 

525 
355 
95 
25 
30 
20 

370 
265 
45 
20 
25 
15 


Small 
(17) 


1,460 


610 

340 

180 

20 

30 

40 

495 
350 

85 
20 
25 
15 

355 
260 
45 
15 
20 
15 


National 
Certainty 

(18) 


105 
60 

10 

10 
15 
15 

10 

30 

5 
10 
5 
5 
5 

15 
5 

5 
5 


Holdings 

in  sample 

for  area 

measurement 

(19) 


370 

155 
85 
45 
5 
10 
10 

125 

90 

20 

5 

5 

5 

90 

65 

10 

5 

5 

5 


(x)  Not  applicable.      -  Entry  represents  zero. 

Column  1.   Sum  of  columns  2  and  3. 

Column  2.   All  EA's  classified  as  rural. 

Column  3.  All  of  the  urban  EA's  in  Cebu  and  Cmbo  cities  and  8  rural- 
like  EA's  in  Zanzi. 

Column  4.   Sum  of  columns  5  and  6. 

Column  5.   Primary  sampling  units  established  for  the  1971  Sample 
Census  of  Agriculture;  equivalent  to  chiefdoms  and  Special  EA's 
in  the  Population  Census. 

Column  6.   Primary  sampling  units  established  for  the  1971  Sample 
Census  of  Agriculture;  equivalent  to  urban  EA's  in  the  Population 
Census  in  Cebu  and  Cmbo  cities  and  the  8  rural-like  EA's  in  Zanzi. 

Column  7.  Hate  used  to  sample  PSU's  within  each  population  group. 

Column  8.   Number  of  PSU's  selected  from  each  population  size  group. 

Column  9.   Total  population  in  the  selected  PSU's. 

Column  10.  Estimates  based  on  average  of  5.5  persons  per  housing  unit, 


Column  11.  Assumes  that  about  95  percent  of  the  housing  units  are 
associated  with  a  holding. 

Column  12.  Holdings  of  less  than  500  hectares. 

Column  13.  Holdings  of  500  hectares  or  more  or  having  specialized 
agricultural  operations. 

Column  14.  Rate  used  to  select  the  small  holdings  to  be  enumerated. 

Column  15.  The  overall  sampling  rate  for  the  Agriculture  Question- 
naire; equivalent  to  column  7  X  column  14. 

Column  16.  Sum  of  columns  17  and  18;  total  number  of  Agriculture 
Questionnaires  to  be  filled. 

Column  17.  Sample  of  small  holdings,  applying  the  sampling  rate  in 
column  14  to  the  estimate  in  column  12. 

Column  18.  Same  as  figure  in  column  8;  all  National  Certainty 
holdings  are  to  be  enumerated. 

Column  19.  One-fourth  of  the  estimate  in  column  17;  area  will  be  re- 
ported, but  not  measured,  for  all  National  Certainty  holdings. 
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Exhibit  IV-5-5.    ENUMERATOR'S  MAP  SKETCH  OF  DAMASCUS  CHIEFDOM  IN  THE  NORTHERN  TERRITORY 
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Exhibit  IV -5-6.    LISTING  BOOK  (NORTHERN  TERRITORY),  FORM  A-2b,  FOR  DAMASCUS  CHIEFDOM 

IN  THE  NORTHERN  TERRITORY 
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Exhibit  IV-5-6. 


LISTING  BOOK  (NORTHERN  TERRITORY),  FORM  A-2b,  FOR  DAMASCUS  CHIEFDOM 
IN  THE  NORTHERN  TERRITORY— Continued 
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Exhibit  IV -5-6.    LISTING  BOOK  (NORTHERN  TERRITORY),  FORM  A-2b,  FOR  DAMASCUS  CHIEF  DOM 

IN  THE  NORTHERN  TERRITORY— Continued 
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Exhibit  IV '-5-6.    LISTING  BOOK  (NORTHERN  TERRITORY),  FORM  A-2b,  FOR  DAMASCUS  CHIEFDOM 

IN  THE  NORTHERN  TERRITORY— Continued 
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Exhibit  IV -7-1.    WORKSHEET:    COMPUTATION  OF  VARIANCES— GROUPED-STRATUM  METHOD, 

FORM  A-281 

(For  use  in  computing  variances   for  total  Provinces,    individual  Provinces,    and  districts.      The   form  has  been  designed  to  accom-nodate  two 
characteristics — columns   2  to  4   for  the   first  and  columns  5  to  7   for  the  second.      Column  8  provides   for  variance  computations  for  ratios 
of  the  two  characteristics) 


Form  A-281 
A(3TOSTAN 
(June   1971) 


CO 


WORKSHEET 


IMPUTATION       OF       VARIANCES 
1971  CENSUS  OF  AGRICULTURE 


Sheet 


of 


(a)  Province  or  district 


Total   sample  PSU' s 


(b)  First  characteristic 

(c)  Second  characteristic 


(d)  Posted  by 


'Late 


First  characteristic 


Second  characteristic 


PSU  No. 
(1) 


Sample  total 
Un.  or  x2) 

(2) 


PSU  difference 
Ui   -  x2) 

(3) 


Square  of 
difference 
( c  oluran  3 ) 2 

(4) 


Sample  total 
(x1   or  x2) 

(5) 


PSU  difference 
Ui  -  x2) 

(6) 


Square  of 
difference 
(column  6)2 

(7) 


Covariance 

(column  3  x 

column  6) 


Total 


Weight 


Weighted 
estimates 


INSTRUCTIONS 

Variances  can  be  computed  separately  for  the  small  holdings  and 

the  Local  Certainty  Holdings.   The  total  variance  will  be  the 

sum  of  the  variances  from  both  sources. 
Items  B,  C — Selected  characteristics  for  which  variances  are  to 

be  computed.  Fill  columns  2,  3,  4  and  5,  6,  7  respectively  for 

these  characteristics. 
Columns  2,  5 — Xj  is  the  sample  total  for  the  first  in  a  pair  of 

PSU's;  Xg  is  the  sample  total  for  the  second  in  the  pair  of 


PSU' s  which  form  a  stratum.  For  purposes  of  computing  the  vari- 
ance, use  column  2  for  the  first  characteristic  and  column  6  for 
the  second  characteristic. 


Columns  3,  4  and  6,  7, 
heads. 


-Compute  as  instructed  in  the  column  box- 


Adjustment:  Results  on  last  line  of  columns  4,  7,  and  8  should  be 

adiusted  by  total  number  of  sample  PSU'  s  in  area . 

number  of  sample  PSU'  s  used  to  compute  variances 
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Exhibit  IV -10-1.    FLOW  CHART  OF  SAMPLING  OPERATIONS 


Determine  areas  of  publication  and  level  of  reliability 


Compile  list  of  National  Certainty  Holdings 


P  RO VINCES 


Form  PSU's  in  each  district  from  1970 
Population  Census  EA's 


NORTHERN      TERRITORY 


Prepare  list  of  chiefdoms  and  Special 
EA's  from  197  0  Population  Census 


Prepare  PSU  Cards  for  specific  districts 


Prepare  PSU  Cards  for  all  chiefdoms 
and  Special  EA's 


Stratification:  Stratify  PSU's  within  each 
district 


Stratification:  Form  five  population  size 
groups  for  each  zone;  stratify  within 
each  size  group 


First-stage  sampling:  Select  PSU's  with 
equal  probability 


First-stage  sampling:  Select  PSU's  with 
different  probability  in  each  size  group 


Listing:  List  all  holdings  in  sample  PSU 


Second-stage  sampling:  Select  sample  of 
small  holdings  listed  in  the  sample  PSU 
(sample  selected  by  supervisor);  designate 
all  Local  Certainty  Holdings  for  the  sample 


Second-stage  sampling:   Select  sample  of 
holdings  from  list  with  probability  in- 
versely proportionate  to  probability  of 
selection  of  PSU's  (sample  selected  by 
enumerator) 


Evaluation  program:  Select  sample  for 
Evaluation  Survey 


Third- stage  sampling:  From  sample  of 
holdings  in  second-stage,    select  a  sub- 
sample  of  holdings  for  measurement  of 
crop  area 


Interviewing:  Fill  Agricultural  Questionnaire  for  each  sample 
holding  and  each  National  Certainty  Holding 


Review  and  observe  enumerator's  work 


Compute  variances,    review  tabulated  data 
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Exhibit  IV -10-2.    CALENDAR  OF  SAMPLING  ACTIVITIES 

(Applies  to  Provinces  only;  for  the  Northern  Territory,  timing  and  content  should  be  adjusted  accordingly) 
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Exhibit  1-2-17.   MAP  LOCATION,  AREA,  URBAN  AND  RURAL  POPULATION,  AND  RURAL  POPULATION 

DENSITY,  FOR  DISTRICTS:    1960 


Map 
loca- 
tion 


District 
( zone ) 


Area 
(in 

km2) 


Population,  1960 


Total 


Urban 


Rural 


Rural 
density 


Map 
loca- 
tion 


District 
( zone ) 


Area 

(in 

km2) 


Population,  1960 


1\.:  'il 


L'b.'in 


Rural 


Rural 
density 


F-10 
G-12 
F-ll 
F-12 
G-ll 

F-ll 
G-ll 
F-10 


G-9 

G-8 
H-8 
G-8 
F-8 

G-9 
G-7 
G-8 
G-7 
H-8 
G-9 
G-8 


C-7 

E-9 
D-7 
E-9 

D-7 

E-8 
D-S 

D-7 
E-8 
E-9 

D-8 
D-8 
C-8 
E-9 


B-10 

C-9 

D-8 

C-9 

C-S 

C-9 
C-9 
C-9 
D-9 


F-6 

E-7 
E-6 
E-5 
E-6 

F-7 
E-6 
F-6 

E-6 


BOLIVAR 

Arcadia. . . . 
Barcelona. . 
Colorado. . . 

Monaco 

Nevada 

Orlando. . . . 
Palestina. . 
Quito 


DEWAR 

Angus 

Daruel 

Daviot 

Gleneig. . . . 
Hamilton. . . 

Latheron . . . 

Muthill 

New  Abbey. . 
Pitlochry . . 
Shleldaig. . 
Torridon. . . 
Woodside . . . 


DUBOIS 

Aurillac 

Bagnois 

Cahors 

Drome 

Fontainbleau . 

Gannat 

Gourdon 

Morillon 

Pontalier . . . . 
Riberac 

Rochef ort 

Sancerre 

Tarn 

Wassy 

HASSAM 

Burdur 

El  Feser 

Gandor 

Giza 

Hama 

Kharga 

Mersin 

Saghbub 

Shendi 


LIBERTO 

Atessa 

Cocina 

Fermo 

Larino 

Lucra 

Nicastro 

Osimo 

Pergola 

San  Giovanni . 


1,369 
2,828 
1,420 
3,217 
1,068 

898 
1,601 
1,102 


3,310 
2,027 
3,009 
3,84-2 
3,924 

3,135 
2,761 
2,157 
3,596 
1,884 
3,840 
2,517 


1,316 
1,551 
2,419 
1,872 
1,480 

2,720 
4,187 
1,453 
3,234 
1,508 

2,689 
2,294 
1,590 
1,730 


1,663 
1,834 
2,283 
2,184 
2,579 

3,515 
3,542 
2,283 
1,490 


1,334 
1,501 
1,509 
2,848 
1,477 

3,440 
2,756 
2,895 
1,442 


12,215 
17,077 
9,819 
23,391 
15,943 

25,390 
14,271 
10,187 


21,421 
14,356 
20,449 
48,884 
35,154 

15,506 
25,339 
17,999 
17,644 
14,357 
23,079 
12,896 


139,000 
24,142 
45,654 
21,225 
36,908 

24,306 
38,196 
51,723 
23,098 
31,363 

24,720 
28,973 
33,846 
26,143 


13,902 
53,850 
24,008 
88,491 

37,044 

33,252 
37,292 
14,924 
15,582 


65,434 
61,713 
69,532 
24,386 
194, 200 

72,612 

69,481 

43,953 

106,119 


3,104 
4,173 
2,315 
843 
5,740 

13,765 

943 

2,547 


4,871 
4,096 
4,964 
2,260 

1,150 

3,498 
9,597 
2,361 
1,135 
3,960 
5,071 
5,380 


82,256 
5,290 

10,044 
8,444 
6,118 

2,807 
2,344 
19,763 
2,327 
6,557 

2,468 
2,104 
8,705 
3,142 


2,783 
30,943 
14,427 
45,627 

2,165 

1,268 

10, 527 

3,603 

2,337 


8,220 
6,344 
2,962 
3,540 
100,207 

10,357 
3,078 
2,700 

45,138 


9,111 
12,904 

7,504 
22,548 
10,203 

11,625 

13,328 

7,640 


16,550 
10,260 
15,485 

46,624 
34,004 

12,008 
15,742 
15,638 
16,509 
10,397 
18,008 
7,516 


56,744 
18,852 
35,610 
12,781 
30,790 

21,499 
35,852 
31,960 
20,771 
24,806 

22,252 
26,869 
25,141 
23,001 


11,119 

22,907 

9,581 

42,864 

34,879 

31,984 
26,765 

11,321 
13,245 


57,214 
55,369 
66,570 
20,846 
93,993 

62,255 
66,403 
41,253 

60,981 


6.7 

4.6 
5.3 
7.0 
9.6 

12.9 
8.3 
6.9 


5.0 

5.1 

5.1 

12.1 

8.7 

3.8 
5.7 
7.2 
4.6 
5.5 
4.7 
3.0 


43.1 
12.2 
14.7 
6.8 
20.8 

7.9 
8.6 

22.0 
6.4 

16.4 

8.3 
11.7 
15.8 
13.3 


6.7 
12.5 

4.2 
19.6 
13.5 

9.1 
7.6 
5.0 
8.9 


42.9 
36.9 
44.1 
7.3 
63.6 

18.1 
24.1 
14.2 
42.3 


E-6 
F-5 
F-6 


E-7 
E-7 
E-9 
E-8 
E-7 

E-8 
F-8 
E-7 
F-8 
F-8 

F-7 
F-7 
E-7 

F-7 
E-7 

F-8 
E-7 
E-7 
F-8 
F-7 


G-10 
F-10 

F-10 
G-10 
G-9 

F-9 

F-10 

F-9 

F-9 

G-10 

G-10 


C-7 
D-7 
D-6 
D-7 
D-6 

C-7 
C-6 
D-6 
C-7 
D-6 
C-6 
C-6 


H-10 
G-12 
H-10 
H-10 
H-ll 
G-ll 
G-ll 


LIBERTO— Con. 

Semara 

Sesto 

Venessa 


LOPEZ 

Arizona 

Aurora 

Betances .... 

Bilbao 

Calicut 

Caribar 

Constitucion 

Daytona 

Elba 

Emela 

Fontana 

Galatea 

Murcia 

Ontario 

Pasadina. . . . 

Rulinda 

Salome 

Santa  Ana . . . 

Sorento 

Teruel 


L0-SAN-TH0 

Bangor 

Cochabamba . . 

Denizli 

Halab 

Jumba 

Kabul 

Larkana 

Managua 

Nigeria 

Ream 

Sumba 


MORGAN 

Aspatria 

Bowes 

Carnf orth .... 

Edmunds 

Glemf ord 

Greenwich .... 

Hindon 

Kidderminster 

Louth 

Matlocks 

Shattesbury. . 
Thirsk 


PARIS 

Alencon. . . 
Barbados . . 
Bourges . . . 

Chalet 

Chateau. . . 
Damasco. . . 
Felletin. . 


2,398 
3,501 
1,490 


1,490 
1,620 
3,677 
3,686 
159 

3,720 
2,019 
1,626 
1,060 
3,537 

2,418 
3,076 

3,574 
3,243 
1,237 

3,622 
1,665 
1,355 
1,938 
1,512 


2,970 
2,084 
1,688 
1,276 
2,861 

2,108 
2,701 
3,301 
2,793 
1,785 
1,853 


3,785 
3,291 
3,665 
1,592 
1,280 

1,813 
2,366 
4,372 
6,496 
4,888 
2,287 
1,832 


1,523 
2,046 
3,584 
1,707 
861 
2,485 
1,614 


85,889 
21,856 
45,351 


62,823 
29,812 
43,728 
63,097 
375,183 

46,942 
42,219 
50,382 
19,517 
39,515 

40,701 
99,384 
96,428 
46,582 

56,075 

40,693 

163,419 

326,712 

31,808 

31,588 


24,015 
18,607 
11,705 
19,649 
33,529 

20,089 
21,639 
42,454 
24,451 
18,307 
21,667 


93,120 
83,685 
75,609 
43,722 
29,192 

40,376 
24,613 
16,846 
11,761 
39,186 
9,510 
131,934 


7,737 
7,901 
31,200 
7,425 
10,464 
23,026 
12,243 


12,184 
3,158 
2,225 


7,469 
6,603 
5,672 
4,395 
375,183 

6,206 
6,555 
8,608 
4,581 
1,326 

2,567 
5,824 
7,071 
1,544 
5,379 

2,161 

24,845 

144,469 

2.576 
3,557 


5,312 
3,290 
625 
6,138 
8,231 

4,578 
418 
7,034 
2,230 
4,007 
5,164 


12,776 

6,427 
12,312 

8,143 
3,645 

1,360 
2,974 
1,582 
3,114 
4,715 
2,416 
64,734 


739 

422 
1,154 

648 
2,175 
1,169 

579 


73,705 
18,698 
43,126 


55,354 
23,209 
38,056 
58,702 

(X) 

40,736 
35,664 

41,774 
14,936 
38,189 

38,134 
93,560 
89,357 
45,038 
50,696 

38,532 

138,574 

182,243 

29,232 

28,031 


18,703 
15,317 
11,080 
13,511 
25,298 

15,511 
21,221 
35,420 
22,221 
14,300 
16,503 


80,344 
77,258 
63,297 
35,579 
25,547 

39,016 
21,639 
15,264 

8,647 
34,471 

7,094 
67,200 


6,998 

7,479 

30,046 

6,777 

8,289 

21,857 

11,664 


30.7 

5.3 

28.9 


37.2 

14.3 

10.3 

15.9 

(X) 

11.0 
17.7 
25.7 
14.1 
10.8 

15.8 
30.4 
.     . 
13.9 
41.0 

10.6 
83.2 
134.5 
15.1 
18.5 


6.3 

7.4 

6,6 

10.6 


7.4 
7.9 
10.7 
8.0 
8.0 


21.2 

23.5 
17.3 
22.3 
20.0 

21.5 
9.1 
3.5 
1.3 
7.1 
3.1 

36.7 


4.6 
3.7 
8.4 
4.0 
9.6 
8.3 
7.2 
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Exhibit  1-2-17.    MAP  LOCATION,  AREA,  URBAN  AND  RURAL  POPULATION,  AND  RURAL  POPULATION 

DENSITY,  FOR  DISTRICTS:    1960— Continued 


Map 
loca- 
tion 


District 
(zone) 


Area 


km" 


Population,  1960 


r  "ii 


Urban 


Rural 


Rural 
density 


Map 
loca- 
tion 


District 
(zone) 


Area 

(in 

km2) 


Population,  1960 


Total 


Urban 


Rural 


Rural 
density 


G-ll 
G-ll 
G-12 
G-ll 
H-10 

H-ll 
G-10 
G-12 
G-ll 
G-10 
G-10 


D-10 
C-10 
D-9 
D-ll 

F-10 

D-9 

D-10 

D-ll 

E-ll 

D-ll 

E-10 
E-10 
D-10 
D-ll 
D-10 
E-10 


H-9 

1-9 
H-8 
H-8 
1-8 

H-9 
H-9 
H-9 
H-10 

H-8 

H-9 
H-8 


PARIS— Con. 

Gisors 

Grecia 

La  Grand  Combe 

Moutier 

Nyons 

Prades 

Sao  Paulo 

Tarbes 

Troy 

Vallon 

Vaucluse 


RAJ  PUR 

Adoni 

Ajmer 

Baroda 

Bellary. . . . 
Bhopal 

Chanda 

Chirmuri. . . 

Dabhor 

Jaipur 

Kanpur 

Karur 

Kistna 

Kolar 

Poona 

Ranchi 

Warangol . . . 


RAMA 

Bogra 

Bundi 

Colombia. . 
Davengere. 
Karania. . . 

Malega. . . . 
Nagina. . . . 

Patan 

Rangpur. . . 
Sirsa 

Surat 

Tuni 


1,819 
1,001 
1,200 
2,031 

1,394 
1,586 
1,907 
1,535 
2,742 
1,810 


3,466 
2,134 
3,711 
3,198 
3,521 

3,276 
3,668 
4,204 
3,246 
3,718 

3,956 
3,258 
2,402 
5,577 

3,381 
3,667 


1,471 
2,064 
1,635 
1,743 
2,306 

2,512 
2,108 
3,568 
2,286 
2,065 

2,964 

1,081 


17,930 
13,980 
4,524 
10,805 
12,186 

10,588 
19,227 
16,783 
15,451 
20,731 
17,441 


21,452 
86,170 
39,926 
21,966 
25,713 

40,021 

40,294 

9,398 

8,672 

25,815 

28,912 
12,914 
28,501 
20,624 
34,669 
55,833 


24,512 
17,810 
28,893 
22,191 
18,734 

16,847 
11,573 
16,456 
18,611 
21,625 

27,864 

19,207 


5,192 
503 
832 

1,261 
488 

1,147 
4,327 
2,011 
1,248 
512 
607 


4,022 
51,298 

6,165 
3,446 
1,150 

2,147 

1,631 

723 

468 

2,297 

2,146 
883 
428 
771 

3,177 
11,036 


7,269 
6,332 
9,371 
6,425 
5,449 

5,979 
2,728 
611 
2,853 
3,617 

2,040 
6,492 


12,738 

13,477 

3,692 

9,544 

11,698 

9,441 
14,900 
14,772 
14,203 
20,219 
16,834 


17,430 
34,872 
33,761 
18,520 
24,563 

37,874 

38,663 

8,675 

8,204 

23,518 

26,766 
12,031 
28,073 
19,853 
31,492 
44,797 


17,243 
11,478 
19,522 
15,766 
13,285 

10,868 
8,845 
15,845 
15,758 
18,008 

25,824 
12,715 


14.4 
7.4 
3.7 
8.0 
5.8 

6.8 
9.4 
7.7 
9.3 
7.4 
9.3 


5.0 
16.3 
9.1 
5.8 
7.0 

11.6 

10.5 

2.1 

2.5 

6.3 

6.8 
3.7 

11.7 
3.6 
9.3 

12.2 


11.7 
5.6 

11.9 
9.0 
5.8 

4.3 
4.2 
4.4 
6.9 
8.7 

8.7 
11.8 


H-8 

H-8 
1-9 


B-7 

C-8 
B-9 
C-7 


B-7 
C-7 
A-8 
B-8 
C-8 

C-8 
B-7 


G-7 
G-6 
H-7 
G-6 
G-7 

F-6 
F-7 
G-6 
G-7 
H-7 


E-12 


D-13 
E-13 


Vellore. 
Victoria 
Wadi. . . . 


TALI 

Acapulco. 
Athens . . . 
Baghdad . . 
Chicago. . 
Congo. . . . 

Delhi 

Ghana. . . . 
Greenland 
Istanbul, 

Karachi. . 

La  Paz. . . 
Manila. . . 

Nikki 

Sokoto. . . 
Wankie . . . 


VALENCIA 


Alberta. . . . 

Aragon 

Costa  Brava 
El  Dorado.. 
Las  Cruces. 


Logrono. . . . 
Monte  Carlo 

Niza 

Pasto 

Puntarenas. 


NORTHERN 
TERRITORY* 

Atbara  River 

Basin 

Northwest. . . . 
Subic 

Mountains . . . 


1,539 
1,110 
3,055 


3,447 
3,562 
2,244 
1,635 
1,354 

2,335 
3,289 
3,960 
1,918 
1,450 

2,970 
1,378 
2,014 
842 
3,926 


1,516 
4,227 
3,649 
4,493 
2,401 

3,816 
2,549 
3,503 
2,802 
2,283 


38,552 
33,261 

22,926 


22,596 
28,926 

8,853 


57,345 
116,108 
48,673 
30,402 
57,094 

36,998 
54,177 
66,436 
48,785 
50,844 

49,451 
69,999 
37,526 
276,854 
81,240 


26,075 
18,353 
23,003 
15,718 
22,868 

21,798 
32,721 
43,579 
34,857 
15,281 


130,000 
95,000 

75,000 


1,922 

6,511 

563 


5,239 

10,447 

5,450 

2,978 

11,266 

2,118 
2,667 
3,887 
2,847 
6,674 

2,163 

22,315 

3,448 

173,113 

5,102 


8,430 
3,927 
2,874 
2,222 

4,971 

3,515 
2,736 
12,546 
3,636 
3,148 


23,500 
2,700 

1,100 


20,674 

22,415 

8,290 


52,106 
105,661 
43,223 
27,424 
45,828 

34,880 
51,510 
62,549 
45,938 
44,170 

47,288 
47,684 
34,078 
103,741 
76,138 


17,645 
14,426 
20,129 
13,496 
17,897 

18,283 
29,985 
31,033 
31,221 
12,133 


106,500 
92,300 

73,900 


13.4 

20.2 

2.7 


15.1 
29.7 
19.3 
16.8 
33.8 

14.9 
15.7 
15.8 
24.0 

30.5 

15.9 
34.6 
16.9 
123.2 
19.4 


11.6 
3.4 
5.5 
3.0 
7.5 

4.8 

11.8 

8.9 

11.1 
5.3 


2.8 

2.8 

3.2 


♦Estimated. 

(X)  Not  applicable. 

Source:   1960  Census  of  Population  for  the  Provinces. 
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Exhibit  1-2-21.    NUMBER  OF  AGRICULTURAL  HOLDINGS  BY  SIZE  OF  HOLDING,  BY  PROVINCES:    1961 

(Excludes  Northern  Territory.   Number  of  holdings  rounded  to  nearest  hundred) 
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Exhibit  1-2-34.    POPULATION  BY  AGE  AND  SEX,  URBAN  AND  RURAL:    1960 

(Excludes   Northern  Territory) 


Age   (years) 


Total 


Total 
(000) 


Male 
(000) 


Female 
(000) 


Urban 


Total 
(000) 


Male 
(000) 


Female 
(000) 


Rural 


Total 
(000) 


Male 
(000) 


Female 
(000) 


NUMBER 

Total 

Under  1  year 

1  to  4  years 

5  to  9  years 

10  to  14  years 

15  to  19  years 

20  to   24  years 

25  to  34  years 

35  to  44  years 

45  to  54  years 

55  to  64  years 

65  to   74  years 

75  years  and  over 


PERCENT 
Total 

Under  1  year 

1   to  4  years 

5  to  9  years 

10  to  14  years 

15  to  19  years 

20   to  24  years 

25  to  34  years 

35  to  44  years 

45  to  54  years 

55  to  64  years 

65  to  74  years 

75  years  and  over 


262 
928 

971 
737 

659 
567 
964 

780 

553 
354 
213 


100.0 

3.7 
13.1 
13.7 

10.4 

9-3 
8.0 

13.6 
11.0 

7.8 
5.0 
3.0 
1.4 


3,459 

131 
467 

491 
370 

329 
277 
470 
384 

259 

156 

90 

35 


100.0 
3.8 

13.5 

14.2 
10.7 

9.5 

8.0 

13.6 

11.1 

7.5 
4.5 
2.6 
1.0 


3,629 

131 
462 
480 
367 

331 
290 

493 

396 

293 
199 
123 

65 


100.0 

3.6 

12.7 
13.2 
10.1 

9.1 

8.0 

13.6 

10.9 

8.1 
5.5 
3.4 
1.8 


1,848 

74 
275 
268 
189 

177 
203 
288 
179 


50 
26 
20 


100.0 

4.0 
14.9 
14.5 
10.2 

9.6 
11.0 
15.6 

9.7 

5.3 
2.7 
1.4 
1.1 


983 

37 
132 
120 

92 

94 
108 
190 

105 

56 

27 

13 

9 


100.0 

3.8 
13.4 
12.2 

9.4 

9.6 
11.0 
19.3 
10.7 

5.7 
2.7 
1.3 
0.9 


865 

37 
144 
148 

96 

83 
95 
99 
74 

42 
23 
13 
11 


100.0 

4.2 
16.6 
17.1 
11.1 

9.6 
11.0 

11.4 
8.6 

4.8 
2.7 
1.5 
1.3 


5,239 

188 
653 
703 
549 

482 
364 
676 

600 

455 

304 

I87 

79 


100.0 

3.6 
12.5 
13.4 
10.5 

9.2 

6.9 
12.9 

11.5 

8.7 
5.8 
3.5 
1.5 


2  ,^75 

94 
335 
371 
278 

234 
169 
281 
279 

203 

129 

77 

26 


100.0 

3.8 
13.5 
15.0 

11.2 

9.5 
6.8 

11.3 
11.3 

8.2 
5-2 
3.1 
1.0 


2,764 

94 
318 
332 
271 

248 
195 
395 
322 

251 
175 
110 

53 


100.0 

3.4 
11.5 
12.0 

9.8 

9.0 

7.1 

14.3 

11.6 

9.1 
6.3 
4.0 
1.9 


Source:      1960  Census   of  Population  for  the _Provinces , 


Appendix 

Exhibit  1-2-35.    NUMBER  OF  PERSONS  IN  THE  HOUSEHOLD,  URBAN  AND  RURAL:    1960 

(Excludes  Northern  Territory) 
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Number  of  persons 


Total 


Number 
(000) 


Percent 


Urban 


Number 
(000) 


Percent 


Rural 


Number 
(000) 


Percent 


All  households1. 


1  person. . 

2  persons. 

3  persons. 

4  persons . 

5  persons. 


6  persons . 

7  persons • 

8  persons. 

9  persons . 

10  or  more 


Median. 


1*346 

60 
139 
176 
197 
211 

174 

139 

84 
70 
96 

5.0 


100.0 


4.5 
10.3 

13.1 
14.6 
15.7 

12.9 

10.3 

6.2 

5.2 

7.1 

(X) 


393 

32 
50 
56 
59 


49 

■(■ 
21 

15 
17 

4.5 


100.0 

8.1 
12.7 
14.2 
15.0 
14.7 

12.5 
9.2 
5-3 
3.6 
4.3 

(X) 


25J 

28 

89 

120 

138 

153 

125 

103 

63 

55 

79 

5.2 


100.0 

2.9 

9.3 

12.6 

14.5 

16.1 

13.1 

10.8 

6.6 

5.8 

8.3 

(X) 


(X)  Not  applicable. 

■"■By  definition,  a  household  consists  of  all  the  persons  who  occupy  a  housing  unit;  the  count  of  households,  therefore, 
is  equal  to  the  count  of  occupied  housing  units. 

Source:   1960  Census  of  Population  for  the  Provinces. 
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Exhibit  11-2-1.    CALENDAR  OF  MAJOR  CENSUS  ACTIVITIES 


CALENDAR  OF  MAJOR  CENSUS  ACTIVITIES-1971  CENSUS  OF  AGRICULTURE 


Activity 


Preliminary  planni 
Census  objectiv. 
Initial  decisions 


Assignment  of  specific 

Budget  estimates 

reparatory  work: 
Concepts  and  questtonr 
Table   formats 


Training  program  and  materials  • 
Pretest  design  and  preparation. 

Pretest  enumeration 

Pretest  processing  and  analysis 
Review  and  revision  of  census  plan 

Printing  of  census  forms 

Publicity  plans 


holdif 


npiting  Hat  of  National  Certainty 


Scope  and  design  of  Evaluation  Sur 

Mapping 

Sample  design. 

Compiling  preliminary  1970  Populatto 


Selection  of  sample  PSU's 
Field  operations: 


Establishment  of  PCO's 

Shipment  of  materials  to  field  . 


Recraltment  of  e 


Training  of  District  Supervise 
Preliminary  field  operations 
Training  of  enumerators. 


ratlt 


Clo 


operations 

Evaluation  Survey  enumeration    . 

Review  of  field  sample  selection 

Editing  and  coding 

Card  punching. 

Tabulation    

Review  and  analysis  of  results 

Computation  and  review  of  sampling 

Publication  of  reports: 

Districts 

Provinces... 

Provinces,    total 

Special  reports 

Development  of  Intercensal  program  . 


il 


Appendix 

Exhibit  II-3-4.    EXAMPLE  OF  A  SUBDIVIDED  SAMPLE  PSU 
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Exhibit  II-3-6.   DISTRICT  MAP  SHOWING  SAMPLE  PSU'S  AND  LOCATION  OF  NATIONAL 

CERTAINTY  HOLDINGS 


{PSIT  a  that  were  subdivided  are  those  with  croBS-hatching  on  the  portion  not  to  be  enumerated) 
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Exhibit  III-2-3.    1971  CENSUS  OF  AGRICULTURE  QUESTIONNAIRE  (NORTHERN  TERRITORY),  FORM  A-lb 


Form  A-lb  (AGROSTAN) 
(March    1971) 


IISTRY    OF    ECONOMY 


1971  CENSUS  OF  AGRICULTURE 
QUESTIONNAIRE 

(Northern  Territory) 


IDENTIFICATION 


A.  Name  of  holder 


Code     E.  Distri 


B.  Listing  Book   j  Page     |  Line        C.   Preliminary  A-lb  No. 
(A-2b) »-l  i 


Code     F.   PSU  No 


FOR  OFFICE  USE 


C.  Final  A-lb  No.        H.  Sampling  weight 


ENUMERATOR:      If  the  respondent  is  someone  other  thon   fhe  holder,  be  sore  he  understands  that  the 


id  "your"  refer  to  the  holde 


SectionI-  METHOD  OF  OPERATION  AND  TENURE 


Section II-  TEMPORARY  CROPS  HARVESTED  THIS  YEAR,  1971 


1.    Method  of  operation 

I  □   On  own  account 

2  □    In  partnership  (Skip  to  3) 

3  Q    As  chieftain  (Skip  to  4) 

,   □   As  h.red  manager  (Skip  to  5) 


FOR  OFFICE  USE 


-F 


This  section  concerns  all   temporary  crops  harvested  this  year,    1971. 
These  are  crops  generally  planted  each  year  or  grown  in   rotation. 
Report  crops  for  this  holding  only. 


CEREALS  FOR  GRAIN 


U 


:^;;::;;:: 


\\\\V\\ 


2.(lf  operated  "on  own  account")  This  questionnaire  concerns  the  holding  you 

operated  on  your  own  account.  Include  all  land  on  which  you  grew  crops  or  kept 
livestock  at  any  time  this  year,  1971.  Da  not  consider  public  or  communal  grazin 
land  as  part  of  this  holding. 


Of  the  land  in  this  holding,   was  any   - 

(  1)    Assigned  to  you  by  your  tribe  or  clan? 

(2)  Owned  by  you?    

(3)  Rented  FROM  others  (or  cash  or  a  share  of  the  praduc 

(4)  Used  on  a  squatter  basis?     

(5)  Used  on   some  other  basis?      

(Specify)     


•■      1   □      *□ 
••       '    □      *□ 


Were  any  of  the  following  cereals  harvested  for  GRAIN    - 

6.  Wheat? 

7.  Rice -paddy  (irrigated)?    

8.  Rice  -upland? 

9.  Sorghum? 

10.  Maize   (corn)? 

11.  Barley? 

13     Other  cereols  for  grain -millet,  oats,  rye,  etc.?    

(Specify)  


I30 


□   None  (NO  for  6  through   13) 


LEGUMES  FOR  GRAIN 


(Skip  to  Section  1) 


IT 


3.  (If  operated  "in  partnership")  This  que 
operated  in  partnership  with  others  at 


Who  is  the  principal  partner?     (If  partners  ore  of  equal  rank, 
the  older  (oldest)  as  the  principal  portner.) 


Were  any  of  the  fallowing  legumes  harvested  mainly   for  GRAIN 


4.     Edible  dry   beans   -   excluding   broad 
beans  (see    16)  and  soybeans  (see  27)? 


|      |  Holder  named  in  item  A  of  the  Identific 
(~_7J  Other  person -Name      


i  Section  (Skip  to  3c) 


15.  Edible  dry  peos  -excluding  chick  peas, 
cowpeas,   and  pigeon  peas  (see   16)? 

16.  Other  legumes  for  grain  -  broad  beans,  chick 

(Specify) 


14  □ 

16  □ 


[73  None   (NO  for  14  through  16) 


b.    Name  of  other  person 

I      I  Appears  in  Listing  Book- 
I       I  Does  not  appear  in   Listin 


TUBER,  ROOT,  BULB  CROPS 


Page 


(Skip  to 
Section  2U) 


Include  all    land   on   which   you   and  your   partner!  si   grew   crops   or 

kept  livestock  at  any  time  this  year.   Do  not  consider  public  or  communal 

grazing  land  as  part  of  the  holding. 

Of  the  land   in  this  holding,   was  any  -  „ 

Yes  No 

(1)  Assigned   to  you  by   your   tribe  or   clan?    l[~J~J  2|~J~J 

(2)  Owned  b,  you?     1  □  2  □ 

(3)  Rented  FROM  others  for  cosh  or  a  shore  of  the  production?.  .  I  |       |  2         | 

(4)  Used  on  o  squatter  basis? l[~J  2[~J 

(5)  Used  on  some  other  basis?    1[~J  Z\       | 

(Specify)    


Were  any  of  the  following  tuber,  root,  or  bulb  crops  harvests 
FOR   FOOD  OR  FOR  LIVESTOCK  OR  POULTRY   FEED  - 


17.     Potatoes,    Irish  or   white  -  early   and   lo 
purposes  including  seed? 


19.  Sweet  potatoes  or  yams? 

20.  Dry   onions? 

21.  Other  tuber,  root,  or  bulb  crops  grown  FOR  FEED- 
turnips,    sugar  beets,   etc.  ? 


.tea 

.!!>□ 

■  2°n 
.2ia 


(Specify)  . 


(If  for  food,  report  in  34  or  43) 
□  None   (NO  for  17  through  21) 


CROPS  FOR  INDUSTRIAL  USE 


(Skip  to  Section  JD 


ted 


4.  (If  operated  "os  chieftoin")At  any  time  this  year,    1971, 
did  you  operate  or  supervise  — 


Any   land  on  which  the  crops 


itly  by  your  tribe  Or  clan?  ]   [~]      z\       | 


b.    Any  grazing  land  on  which  the  livestock  was  owned  [ointly 
by   the  tribe  or  clan? 


'□      *□ 

(if    "No"  to  both  4o  and  4b,  Skip  to  Section  XE)  [T 

(If  "Yes  "  in  either  4o  or  4b)  -  This  questionnaire  concerns  the  holding  which 
you  operated  or  supervised  as  o  chieftain.    Do  not  consider  land  assigned  to 
individual  members  of  your  tribe  OS  port  of  this  holding.     (Skip  to  Section  IT) 


Were  any  of  the  following  temporary  crops  ha 
mainly   for   INDUSTRIAL    USE   - 

22.  Sugarcane  cut  for  sugar  or  syrup  or  for  sole  to  mill  s? 

23.  Sugarcane  cut  for  other  purposes  except  for  feed  or  thotching 

24.  Cotton? 

25.  Groundnuts  (peanuts)? 

26.  Tobacco?    

27.  Soybeans? 


for  industrial  use  - 


5.  (If  operated  "os  hired  manager")  This  questionnaire  concerns  the  holding 
you  operated  as  a  hired  manager,  this  year,    1971. 


Who 


r  employer   s  name 

b.  Is  your  employer  - 

1  |  A  private  person? 

2  r~J  A  corporation? 
3[7J  A  cooperative? 

c.  What  is  the  mailing  address  of  your  employ 


4  |       |  A   government  agency? 
□  Other  (Specify) 


(Zone) 


(District) 


(Hamlet  or  chiefdom) 


d.    The  holding  you  manage  for  others  includes  all   land  on  which  you  grew 
crops  or  kept  livestock  for  your  employer.    Do  not  consider  public  or 
communal   grazing  land  as  port  of  this  holding. 

Of  the  land  in  this  holding,   did  your  employer  -                                               yes  Nq 

(1)  Own  any? id  *□ 

(2)  Rent  any    FROM  others  for  cash  or  a  share  of  the  production?  .    I  j       |  Z  I       I 

(3)  Use  any  an  a  squatter  basis? '□  2^ 

(4)  Us.  any  on  some  other  basis? 1  □  2  I       I 

(Specify)    [J 


.220 

.230 

.2sn 
.26^ 

.  27  □ 

.zb-n 


(Specify)    

|      |  None   (NO  for  22  through  28) 


FODDER  CROPS 


Were  any  of  the  following  fodder  crops  harvested  from  land  in  crop  I 

29.  Legumes,  grasses  or  cereals  cut  for  hay? 

30.  Legumes,  grosses  or  cereols  cut  for  green  feed  or  silage?    .... 

31.  Other  fodder  crops  -pumpkin,    sunflower,   cabboge,    sugarcane 


for  feed,   etc 

(Specify) 


.29 

.  30  □ 

.3,n 


|      |  None  (NO  for  29  through  3D 


FIELD  SEEDS 


Were  any  of  the  following  field  seed  crops  harvested  - 
32.    Alfalfa  seed? 


33.    Other  field  seed  crops  -  lespedeza,  orchordgn 
white  clover,  etc.? 


.32  0 
•33Q 


□  None   (NO  for  32  and  33) 
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Exhibit  III-2-3.    1971  CENSUS  OF  AGRICULTURE  QUESTIONNAIRE  (NORTHERN  TERRITORY), 

FORM  A- lb --Continued 


Section  I  -  TEMPORARY  CROPS  HARVESTED  THIS  YEAR,  1971-Continued 


VEGETABLES  AND  MELONS  FOR  HOME  USE 


34.    Were  any    vegetables  o.  ,n?lons  grown  mui:tly  lor  USE  OF   THE 
HOUSEHOLDS  ON  THIS  HOLDING   -   inclujnig   /egeloble* 
ond  melons  from  kitchen  or  home  gordens? 


■3*U1 


□  None  (NO 


VEGETABLES  AND  MELONS  TOR  SALE 


ony   of  the   follow. ng   vegetables  or  melons 
y  for  SALE   FOR  HUMAN  CONSUMPTION 


35.  Cobboge? 

36.  Watermelons? 

37.  Cantaloupes  ar.d  other  melons?    . 

38.  Tomatoes? 

39.  Beans-harvested  green? 

40.  Peas-harvested  green? 

41.  Maize  or  sweetcorn-harvested  gn 
43.  Other  vegetables  mo.nly  lor  sale' 

(Specify)    


37  n 

38  □ 

39  □ 

40  □ 

41  LZ 


|      |  None  (NO  lor  35  through  43) 


FLOWERS,  BULBS,  PLANTS 


44.    Were  ot.y  llowers,  {lower  bulbs,  corms,  or  orno 
plants  grown  (or  sale,   seed,  or  industrial  use? 

(Specify  genera)  c/oss) 


OTHER  TEMPORARY  CROPS 


2 


45.    Were  ony  other  temporary 
crops  generally  planted  i 

(Specify)  


i  harvested  -  tho 


45  □ 


□  None  (NO  for  45) 


Section  IK-  PERMANENT  CROPS 


This  next  section  concerns  permanent  (or  perennial)  crops,  trees,  vines  and 
berries  on  this  holding    this  year,  1971.  Report  for  this  holding  only. 


CITRUS  FRUITS 


Were   there   any   of  the   following  citrus   fruit  trees   on   this   holdir 
es?    


46.  Orange-all  VI 

47.  Lemon?     

48.  Other  citrus  fruit  trees-lime,  tangerine,  grapefruit,  e 

(Specify)    


•««□ 

,47rn 
,  4sn 


]  None  (NO  for  46  through  48 J 


OTHER  CULTIVATED  TREES,  PLANTS,  VINES 


or  nut  trees,  plants,  or  > 
49.    Peach-including  ne 


i  this  holding  . 


50.  Fig? 

51.  Banana?    

52.  Plaintain?  

53.  Pineapple?    

54.  Almond?     

55.  English  walnut? 

56.  Crape?      

57.  Other  cultivated  fru.t  or  nut  trees,  plant! 
apple,   pear,  plum,  cherry,   papaya,  etc' 

(Specify)   

I  None  (NO  for  49  through  57) 


•«!»□ 

■  50  |       | 

■  sin 

.   53  □ 
.   54  □ 

■  =5  □ 

•  sen 


CROPS  FOR  INDUSTRIAL  USE 


any   of  the   lollow.ng  permanent  (pe 
.  mainly  lor  INDUSTRIAL  USE  - 


58.  Coffee?    .. 

59.  Cacao?     .. 

60.  Tea? 

61.  Coconut?  . 

62.  Olive?    ... 

63.  Sisal? 

64.  Rubber?  .. 


65.    Other  permanent  (perennial)  crops  m 

industrial  use-oil  palm,  jute,  mulbe 

(Specify)    


•  ssfZl 

60  n 

.620 

.ssn 


|      |  None  (NO  for  58  through  65) 


Section  HI  -  PERMANENT  CROPS-Continued 


CULTIVATED  SMALL  FRUITS  AND  BERRIES 


other   small  fruits 
(Specify)   


spberries,  blueberries.  Yes 

i  on  this  holding? 66  □ 


|      |  None  (NO  lor  66) 


PERMANENT  GRASSES 


67.     Were   ony   perr 


en   ,1, 

,  (NO  for  t 


•6?  CJ 


NURSERY  TREES,  VINES,  BUSHES 


68.    Wos  ther, 


permanent  (pe 


□  None  (NO  lor  68) 


r  trees,  bushe: 
>  this  holding' 


.  68  □ 


OTHER  PERMANENT  CROPS 


69.     Were   any   other  permanent  (perennial)   crops   grown   on   this  holding?     69  [" 

(Specify)   

□  None  (NO  for  69) 


Section  E-  PRODUCTION  FOR  SPECIFIC  CROPS 


(Refer  to  items  6,  7,  8,  24,  58,  ond  63  to  determine  whether 
the  following  crops  were  grown  this  yeor.) 


Yes 

70.  Wheat □ 

71.  Rice □ 

"•    Cotton    □ 

73.  Coffee    □ 

74.  Sisal    □ 


What  wos  the  total  production  of   (wheat  ] 
[rice]   fcotton]  feoffee]   [>isal  ]  this  year, 
1971?  If  ony  crop  was  shared,  report  total- 
not  just  your  shore. 

(Report  rice  production  in  rough,  cotton 
proouction  in    "lint  ond  seed.") 


/10 


/io 


/io 


/io 


Othe 
(Speci 


□  None  (NO  for  70  through  74) 


Section  Y-  INTERPLANTED  AND  SUCCESSION  CROPS,  THIS  YEAR,  1971 


This  next  section  refers  to  all  temporary  and 
permanent  crops  you  grew    this  year,  1971. 


INTERPLANTED  CROPS 


75.    Were  any  of  the  crops  interplanted  (intercropped)  this  ye 
that  is  grown  on  the  same  land  at  the  same  time? 


SUCCESSION  CROPS 


76.    Were  any  of  the  crops  horve 

1    QYes 


land   this   year? 


2    QNo 


Section  51  -  IRRIGATION  AND  FERTILIZATION,  THIS  YEAR,  1971 


This  next  section  also  refers  to  all  temporary  and 
permanent  crops  you  grew    this  year,  1971. 


77.    Were  ony  of  the  crops   irrigated  this  year?  Irrigation  refers  to  supplying  land 
with  watet  other  thon  rainfall.  When  water  is  collected  and  btaught  to  the 
land  later,   consider  this  as  irrigation.  Also,   include  purposeful  flooding  of 
land  by  rivers,  streams,  ar  dams. 

1    QYes  2    □  No  (Skip  to  78) 

a.  Which  crops  were  irrigated? 


78.  Were  any  chemical  or  other  commercial  fertilisers  applied  to  any  cropland 
this  yeor?  Include  rock  phosphate  but  do  not  include  lime  ot  gypsum.  Also, 
do  not  include  farmyard  manure  or  crop  residue. 

I    □  Yes  2    □  No  (Skip  to  Section  ED) 

a.    Which  crops  were  fertilized? 
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Exhibit  III-2-3.    1971  CENSUS  OF  AGRICULTURE  QUESTIONNAIRE  (NORTHERN  TERRITORY), 

FORM  A-lb — Continued 


SectionYH-  LIVESTOCK  AND  POULTRY 


Section  Vm  -  SALE  OF  AGRICULTURAL  PRODUCTS 


This  next  section  concerns  livestock  and  poultry.  Report  all  livestock  and 
poultry  on  this  holding,  whether  owned  by  you,  by  your  landlord,  or  by  some- 
one else.  Include  livestock  kept  on  communal  or  public  grazing  land  if  owned 
!  by  you  J     [  jointly  by  your  tribe  or  clan  J     I  by  your  employer  I  .  If  livestock  is 
associated  with  more  than  one  holding,  report  animals  that  are  kept  on  this 
holding  during  the  night. 


This  next  section  refers  to  the  sale  of  agricultural  products  of  this  holding 
during  1971.  Agricultural  products  include  crops,  livestock,  livestock  prod- 
ucts, poultry,  poultry  products,  forest  products,  fish,  etc. 


ANIMALS  ON  HOLDING 


Are  there  any  of  the  following   r 

holding  or  on  communal  gracing  land  — 

79.  Horses  -  all  ages? 

80.  Mule 


-   all  age 

81.  Asses  -   all  age 

82.  Cattle  and  colv, 


"(1)  Cows  and  heifer: 

should 


(2)  All  other  cattle 


run 

Yes 

□ 

□ 

□ 
□ 


!3.    Sheep  and   lambs 


(Total  ol         (    (])  Under   1    year  old? 

(l)and(2) 

should  < 

eqvol  83)  (2)  1   year  or  older'    . 


84.    Goats  and  kids  -   all  ages? 

(Total  of  (  (1)  Under  1    year  old? 

0)  ona  Q)  J 

should  S 

equo!84)  I     <2)  1    year  or  older?    . 


□ 


□ 


85.    Hogs  and  pigs  -   all   ages? 

(Total  oi         f    (1)  Under  6  months  old? 

(I)  ond  (2)  J 

should  \ 

equo!85)        I     (2)  6  months  or  older'  . 


this  holding  -  camels,  buffaloes,   llamas 
etc.  -  all  ages? 


□ 


□ 


( Specify)    

|      |  None  (NO  for  79  through  86.     If  NONE,  slip 


ANIMALS  MILKED 


Were  any  of  the  fallowing  animals  milked 

87.  Caws  (cattle)' 

88.  Goats? 

89.  Other  animals  -  sheep,  cornels,  buffol 

(Specify)    


pa.  Ho 

Yes    

□ 


LZ 


|      |  None  (NO  for  87  througn  89) 


ANIMALS  FOR  TRANSPORT  OR  DRAFT  PURPOSES 


Were  any  of  the  following  animals  now  on  this  holding 
for  transport  or  draft  purposes  at  any  time  this  year  — 
that  is,  for  carrying   loads,  pulling  or  drawing   loads,  plo 


t   is,   for  carrying    loads,   pulling   or 
sshing,  grinding,   lifting  water,  etc 


90.  Cottle,  including  oxen? 

91.  Buffaloes' 

92.  Other  animals  -  camels,  elephants,   llorr 

(Specify)    


I      I  None  (NO  for  90  through  92) 


POULTRY  ON  HOLDING 


93.    Chickens  -  oil 


(I)  ond(2) 
should 
equol  93) 


(2)  All  chickens 

4  months  or  olde 


94.    Other  poultry  -  turkeys,  ducks,  geese,  i 
all  ages? 


Cj 


□ 


(Specify) 


□  None  (NO  for  93  and  94) 


95.    Did  the  totol  cash  received  during   1971   from  the  sale  ol  agricultural  prcdu 
of  this  holding  amount  to  - 

1  □  5100  or  more' 

2  □  $25  to  599? 

3  □  Less  than  $25?  (Slip  !o  Sect/or  K) 


OlOops' 1    □YesfSM 

2    QNo 


(2)   Livestock   and   poultry   - 
including  livestock  produc 
and   poultry   products?  .... 


(3)  Other  agricultural  produ 


1  □  Yes  (Skip  to  S 

2  QN. 

1  O  Yes 

2  □  No  (Skip  to  Secrion  K) 


E) 


the  other  agricultural   produc 


SectionK-  AGRICULTURAL  POWER,  EQUIPMENT, 
AND  TRANSPORT  FACILITIES 


These  next  questions  are  about  power,  equipment,  and  transport  facilities. 


96.    This  year,  was  there  electri 

a.  In  the  dwelling? 

b.  Elsewhere   on  the  holding 


«    □         2    □ 

1     □         2    □ 


I'. 


97.    Was   the  agricultural   work   in   the   fields   or   elsewhe 
this   holding   done   - 


a.  In   part   by   animal   power? 

b.  In-part  by  mechanical  power  —  that  is,  by  machir 
propelled  or  operated  by  petrol,  oil,  gos,  steom, 
wind,  electricity? 


'     □         2 


(If  NO  in  both  97o  ond  97b,  skip  lo  09) 


98.     This   year,   was   any   of  rhe   following   equipment   used   fo 
agricultural  work  on  this  holding  - 


a.  Animal  drawn  plows 

b.  Tractor-operoted  plo 


d.  Threshers  (engine-operated)? 


•  □    *  □ 
1  □    ^  □ 

'     □         *    □ 


99.    How  many  of  the  following  vehicle 
kept  on  this  holding  - 


a.  Jeeps,  stat 

b.  Automobile: 
C.    Animol-d.av 


□ 
□ 
□ 


Section  X- CHARACTERISTICS  OF  HOLDER'S  HOUSEHOLD 


100.  These  next  questions  relate  to  the  people  who  live  in  your  housing  unil 
are  the  names,  relationship,  and  ages  of  all  the  persons  who  live  in  yoi 
housing  unit?  Include  infants,  hired  workers,  ond  other  persons  who  usi 
live  with  you.     (List  holder  on   ihe  first  line.) 


11, 


OFFICE  USE 


d.  Age 
of  last 
birthda 


134 


Appendix 


Exhibit  III-2-3.    1971  CENSUS  OF  AGRICULTURE  QUESTIONNAIRE  (NORTHERN  TERRITORY), 

FORM  A- lb --Continued 


Section  XI  -  MEASUREMENT  OF  CROP  AREA 


Section M-  ENUMERATOR'S  RECORD 


RECORD  OF  VISITS 


1  □  Holding  in  sample  for  area  measurement  (circled  A-lb  number 

in  column   U  or  15  or  A-2b  Listing  Book) 

2  □  Holding  not  in  sample  for  area  measurement(A-1b  number  NOT 

circled  in  column   U  or  15  of  A-2b  Listing  Book. 
Skip  to  Section  JTTJ .) 


LAND  AREA  IN  CROPS 


(Refer  fo  Sections  II  and  HI  and  del: 
the  following  crops  were  grown  this 


Ine  whethe 
ir,   1971.) 


of 


102.   Wheat    □  □ 

'03-   R.ce   □  □ 

104.  Cotton □  □ 

105.  Coffee □  □ 

106.S,sal    □  □ 

107.  Other  crops □  □ 


(II  NO  'or   102  through   107, 
slip  to  Section  XU) 


TOTAL- 


IS umber  of 
separate  plots 


FOR 
OFFICE  USE 


1  QHolde 

2  Owife 


□  Other  (Specify) 


□  Nonintervi 
(Explain) 


Now,  I  wont  you  to  show  me  all  the  land  on  which  you  grew  crops  this  year.  We  will  start  with  the  field  or  tract  which  is 

closest  to  here  and  measure  all  separate  fields  or  tracts  on  which  crops  were  grown.  We  will  also  measure  the  specific 

area  (s)  on  which    [wheat]     [rice]     [cotton]    [coffee]     [sisal]    was  (were)  grown.  (Moke  sketch  be/ow;  then  go  ro  Section  ZH.) 


Respondent  definitely  len 


SPECIAL  CASES 

ch  land  was  in  each  cr 


nd  has   10  or  more  hectares  of  i 


□  Hectare  □  Tanbo  □  Man2ana  |      |  Other  fSpecify) 


act.  field, 
plot  No. 


|/10 


!  /io 


|/io 


i    '" i ■    ■" i ' '  ■"■ '  -  '"i 

Verify  with  respondent  that  this  was  the  approximate  total 
II  not,  review  figures  and  correct  accordingly.    Co  to  Secti 


Othe 


il  o(  fond  in  crops 


nXII.) 


Page  4 


Appendix 

Exhibit  VI-2-4.    PSU  CONTROL  RECORD  FOR  MAP  PREPARATION,  FORM  A-271 


135 


SHFFT. 

OP                                    «"CCTS 

Form  A-271  (AGROSTAN) 
(June  1970) 

MINISTRY   OF  ECONOMY 
NATIONAL    STATISTICAL    OFFICE 

PSU  CONTROL  RECORD 
FOR  MAP  PREPARATION 

1971  CENSUS  OF  AGRICULTURE 

A.  Province  or  zone              ICode 
1 

1 

B.  District                                |Code 
1 

i 

C.  PSU 

Fron 

numbers 
i                      _   to 

D.  Date  request  sent  to 
Geography  Division 

E.  Map  transmittal 

I^JSent  by 

Geography  Division 

j      |  Received  in 

Agriculture  Division 

PSU  maps  (3  sets) 

District  maps  (8  copies) 

Initials             Date 

Initials             Date 

1 
1 

Line 
No. 

(1) 

Subdistrict  No. 
(2) 

EA  No. 

(3)               \ 

PSU  No. 

(4) 

PSU  map  prepared 
(Date) 

(5) 

Comments 
(6) 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 

27 

28 

29 

30 

BUFF  -=  Geography  Division 
WHITE   —  Agriculture  Sampling 


BLUE  -  Central  Control 


\ 
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